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SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) APPLICANT: Hadlac/.ky, Gyula 
Szalay, Aladar 



(11) TITLE OF THE INVENTION: ARTIFICIAL CHROMOSOMES, USES THEREOF 



AND METHODS FOR PREPARING ARTIFICIAL CHROMOSOMES 



(iii) NUMBER OF SEQUENCES: 3 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Heller Ehrman White & McAuliffe 

(B) STREET : 4250 Executive Square , 7th Floor 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 28-NOV-2000 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 0 8/83^, ?, . 

(B) FILING DATE: 10-APR-1997 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/695,191 

(B) FILING DATE: 07-AUG-1996 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/682,080 

(B) FILING DATE: 15-JUL-1996 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/629,822 

(B) FILING DATE: 10-APR-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Seidman, Stephanie L 

(B) REGISTRATION NUMBER: 33,779 

(C) REFERENCE/DOCKET NUMBER : 24601402F 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 858-450-8403 

(B) TELEFAX: 858-587-5360 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO : 1 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 12 93 base pairs 

(B) TYPE: nucleic acid 

(C) S T RAT J D E D N E S S : single 

( D) TOPOLOGY : 1 1 near 

(ii) MOLECULE TYPE: Genomic DNA 
( l i i ) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE : 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GAATTCATCA TTTTTCANGT CCTCAAGTGG ATGTTTCTCA TTTNCCATGA TTTTAAGTTT 6 0 

TCTCGCCATA TTCCTGGTCC TACAGTGTGC ATTTCTCCAT TTTNCACGTT TTNCAGTGAT 12 0 

TTCGTCATTT TCAAGTCCTC AAGTGGATGT TTCTCATTTN CCATGAATTT CAGTTTTCTN 180 

GCCATATTCC ACGTCCTACA GNGGACATTT CTAAATTTNC CACCTTTTTC AGTTTTCCTC 24 0 

GCCATATTTC ACGTCCTAAA ATGTGTATTT CTCGTTTNCC GTGATTTTCA GTTTTCTCGC 3 00 

CAGATTCCAG GTCCTATAAT GTGCATTTCT CATTTNNCAC GTTTTTCAGT GATTTCGTCA 360 

TTTTTTCAAG TCGGCAAGTG GATGTTTCTC ATTTNCCATG ATTTNCAGTT TTCTTGNAAT 42 0 

ATTCCATGTC CTACAATGAT CATTTTTAAT TTTCCACCTT TTCATTTTTC CACGCCATAT 4 80 

TTCATGTCCT AAAGTGTATA TTTCTCCTTT TCCGCGATTT TCAGTTTTCT CGCCATATTC 54 0 

CAGGTCCTAC AGTGTGCATT CCTCATTTTT CACCTTTTTC ACTGATTTCG TCATTTTTCA 600 

AGTCGTCAAC TGGATCTTTC TAATTTTCCA TGATTTTCAG TTATCTTGTC ATATTCCATG 660 

TCCTACAGTG GACATTTCTA AATTTTCCAA CTTTTTCAAT TTTTCTCGAC ATATTTGACG 72 0 

TGCTAAAGTG TGTATTTCTT ATTTTCCGTG ATTTTCAGTT TTCTCGCCAT ATTCCAGGTC 780 

CTAATAGTGT GCATTTCTCA TTTTTCACGT TTTTCAGTGA TTTCGTCATT TTTTCCAGTT 84 0 

GTCAAGGGGA TGTTTCTCAT TTTCCATGAG TGTCAGTTTT CTTGCTATAT TCCATGTCCT 90 0 

ACAGTGACAT TTCTAAATAT TATACCTTTT TCAGTTTTTC TC AC CAT ATT TCACGTCCTA 9o0 

AAGTATATAT TTCTCATTTT CCCTGATTTT CAGTTTCCTT GCCATATTCC AGGTCCTACA 10 JO 

GTGTGCATTT CTCATTTTTC ACGTTTTTCA GTAATTTCTT CATTTTTTAA GCCCTCAAAT 1000 

GGATGTTTCT CATTTTCCAT GATTTTCAGT TTTCTTGCCA TATACCATGT CCTACAGTGG . \< 

ACATTTCTAA ATTATCCACC TTTTTCAGTT TTTCATCGGC AC AT T T C AC G TCCTAAAGTG 12 0 0 

TGTATTTCTA ATTTTCAGTG ATTTTCAGTT TTCTCGCCAT ATTCCAGGAC CTACAGTGTG 12 6 0 

CATTTCTCAT TTTTCACGTT TTTCAGTGAA TTC 12 9 3 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

AGGCCTATGG TGAAAAAGGA AATATCTTCC CCTGAAAACT AGACAGAAGG ATTCTCAGAA 6 0 

TCTTATTTGT GATGTGCGCC CCTCAACTAA CAGTGTTGAA GCTTTCTTTT GATAGAGCAG 12 0 

TTTTGAAACA CTCTTTTTGT 7AAAATCTGCA AGAGGATATT TGGATAGCTT TGAGGATTTC 18 0 

CGTTGGAAAC GGGATTGTCT TCATATAAAC C C TAG AC AG A AGCATTCTCA GAAGCTTCAT 24 0 

TGGGATGTTT CAGTTG7AAGT CACAGTGTTG AACAGTCCCC TTTCATAGAG CAGGTTTGAA 30 0 

ACACTCTTTT TTGTAGTATC TGGAAGTGGA CATTTGGAGC GATCTCAGGA CTGCGGTGAA 36 0 

AAAGGAAATA TCTTCCAATA AAAGCTAGAT AGAGGCAATG TCAGAAACCT TTTTCATGAT 42 0 

GTATCTACTC AGCTAACAGA GTTGAACCTT CCTTTGAGAG AGCAGTTTTG AAACACTCTT 4 80 

TTTGTGGAAT CTGCAAGTGG ATATTTGTCT AGCTTTGAGG ATTTCGTTGG GAAACGGGAT 54 0 

TACATATAAA AAGCAGACAG CAGCATTCCC AGAAACTTCT TTGTGATGTT TGCATTCAAG 600 

TCACAGAGTT GAACATTCCC TTTCATAGAG CAGGTTTGAA ACACACTTTT TGATGTATCT 660 
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GGATGTGGAC 
AAGTAGACAc ] 
GAAGCTTTCT 
ATTTGTGTAG 



ATTTGCAGCG 
AAGCATTCTC 



CTTTCAGGCC 
AGAAACTTAT 



TTTGATAGAG GCAGTTTTGA 



TAAGGTGAAA 
TTGTGATGTG 
AACACTCTTT 



CTTTGAGGAT TTCTTTGGAA ACGGGATTAC 



AGGAAATATC 
CGCCCTCAAC 
TGTGGAATCT 
ATATAAAAAG 



CATTCCCAGA ATCTTGTTTG TGATGTTTGC ATTCAAGTCA CAGAGTTGAA 



TTCCCCTGAA 
TAACAGTGTT 

gcaagtggat 
cagacagcag 
cattgccttt 



cagagagcag gtttgaacac tctttttata gtatctggat gtggac/ott ggagcgcttt 
caggggggat cctctagaat tcct 



72 0 
780 
84 o 
9 0 (J 
9 b 0 
1020 
104 4 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2492 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
( ix) FEATURE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CTGCAGCTGG GGGTCTCCAA TCAGGCAGGG GCCCCTTACT ACTCAGATGG GGTGGCCGAG 0 0 

TAGGGGAAGG GGGTGCAGGC TGCATGAGTG GACACAGCTG TAGGACTACC TGGGGGCTGT 12 0 

GGATCTATGG GGGTGGGGAG AAGCCCAGTG ACAGTGCCTA GAAGAGACAA GGTGGCCTGA 10' 

GAGGGTCTGA GGAACATAGA GCTGGCCATG TTGGGGCCAG GTCTCAAGCA GGAAGTGAGG 2 10 

AATGGGACAG GCTTGAGGAT ACTCTACTCA GTAGCCAGGA TAGCAAGGAG GGCTTGGGGT 'o- 

TGCTATCCTG GGGTTCAACC CCCCAGGTTG AAGGCCCTGG GGGAGATGGT C C C AGG AC AT 

ATTACAATGG ACACAGGAGG TTGGGACACC TGGAGTCACC AAACAAAACC ATGCCAAGAG 42 0 

AGACCATGAG TAGGGGTGTC CAGTCCAGCC CTCTGACTGA GCTGCATTGT TCAAATCCAA 4 80 

AGGGCCCCTG CTGCCACCTA GTGGCTGATG GCATCCACAT GACCCTGGGC CACACGCGTT 04 0 

TAGGGTCTCT GTGAAGACCA AGATCCTTGT TACATTGAAC GACTCCTAAA TGAGCAGAGA 600 

TTTCCACCTA TTCGAAACAA TCACATAAAA TCCATCCTGG AAAAAGCCTG GGGGATGGCA 0>,0 

CTAAGGCTAG GGATAGGGTG GGATGAAGAT TATAGTTACA GTAAGGGGTT TAGGGTTAGG 72 0 

GATCAACGTT GGTTAGGAGT TAGGGATACA GTAGGGTACC GGTAGGGTTA GGGGTTAGGG 780 

TTAGGGGTTA GGGTTAGGGT TAGGGTTAGG GTTAGGGTTA GGGGTTAGGG GTTAGGGTTA 84 0 

GGGTTAGGTT TTGGGGTGGC GTATTTTGGT CTTATACGCT GTGTTCCACT GGCAATGAAA 900 

AGAGTTCTTG TTTTTCCTTC AGCAATTTGT CATTTTTAAA AGAGTTTAGC AATTCTAACA 96 0 

GAT AT AG AC C AGCTGTGCTA TCTCATTGTG GTTTTCAATT GTAACCACAT TGTGGTTTCA 102 0 

ATGTGTTTAC TTGCCATCTG TAGATCTTCT TTGCGTGAGG TGTCTGTTCA GATGTGTGTG 10 8 0 

CATTTCTTGN NTTTNGGCTG TTTAACTTAT TGTTTAGTTT TAATAATTTT TTATATATTT 114 0 

GAAGACAAAT CTTTCTCAGA TGTGTATTTG CAAATATTTC TTCAATATGA GGCTTGCTTT 12 0 0 

TGTCTCTAAC AAGGTCTCTT CAGAGATAAC TTAAATATAA GAAATCCACA CTGTCACTTC 12 6 0 

TTTTGTGTAT ATCTACCTTT TGTGTCATTT GTTAAAATTC ATTACCAAAC CCAAAGGCAG 13 2 0 

ATAGCTTTTC TTCTATTGTT TCTTCTAGAA ATTTGTATAG TTTTGCATTT TTAGTGTAAG 13 80 

GATGATTTTG AGTGATTATT TGTGTAAGTT GTAAAGTTTT CGTCTATATC CATATCATTT 14 4 0 

CTTATGGTTT CCAATTAATC GTTCCCTCAC TATTTTTGGG AAAGACACAG GATAGTGGGC 150 0 

TTTGTTAGAG TAGATAGGTA GCTAGACATG AACAGGAGGG GGCCTCCTGG AAAAGGGAAA 156 0 

GTCTGGGAAG GCTCACCTGG AGGACCACCA AAAATTCACA TATTAGTAGC ATCTCTAGTG 16 2 0 

CTGGAGTGGA TGGGCACTTG TCAATTGTGG GTAGGAGGGA AAAGAGGTCC TATGCAGAAA 16 8 0 

GAAACTCCCT AGAACTCCTC TGAAGATGCC CCAATCATTC ACTCTGCAAT AAAAATGTCA 174 0 

GAATATTGCT AGCTACATGC TGATAAGGNN AAAGGGGACA TTCTTAAGTG AAACCTGGCA 1800 

C C AT AAGT AC AGATTAGGGC AGAGAAGGAC ATTCAAAAGA GGCAGGCGCA GTAGGTACAA 186 0 

ACGTGATCGC TGTCAGTGTG CCTGGGATGG CGGGAAGGAG GCTGGTGCCA GAGTGGATTC 192 0 

GTATTGATCA CCACACATAT ACCTCAACCA ACAGTGAGGA GGTCCCACAA GCCTAAGTGG 1980 

GGCAAGTTGG GGAGCTAAGG CAGTAGCAGG AAAACCAGAC AAAGAAAACA GGTGGAGACT 2 04 0 

TGAGACAGAG GCAGGAATGT GAAGAAATCC AAAATAAAAT TCCCTGCACA GGACTCTTAG 210 0 

GCTGTTTAAT GCATCGCTCA GTCCCACTCC TCCCTATTTT TCTACAATAA ACTCTTTACA 2160 

CTGTGTTTCT TTTCAATGAA GTTATCTGCC ATCTTTGTAT TGCCTCTTGG TGAAAATGTT 222 0 

TCTTCCAAGT TAAACAAGAA CTGGGACATC AGCTCTCCCC AGTAATAGCT CCGTTTCAGT 22 80 
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TTGAATTTAC AGAACTGATG GGCTTAATAA CTGGCGCTCT GACTTTAGTG C JTGCAGGAGG 1-M4 0 
CCGTCACAC C 1 G ' 3 G A 0 1 3 A AC.; A GTGCGGTGGG T ACT C ( ' C 1 r A ' V CTGC 1 GGG( 'AG, G T G G C G G C T G ! ! 4 0 0 



(2) INFORMATION FOR SEQ ID NO : 4 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 8 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS : s ingle 
(Di TOPOLOGY : linear 

(ii) MOLECULE TYPE : Genomic DNA 

(iii ) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GGGGAATTCA TTGGGATGTT TCAGTTGA 2 8 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic LNA 
( iii J HYPOTHET I CAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CGAAAGTCCC CCCTAGGAGA TCTTAAGGA 2 9 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: RNA 

(iii) HYPOTHETI CAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CCGCTTAATA CTCTGATGAG TCCGTGAGGA CGAAACGCTC TCGCACC 



CCTCGACACT GACAGCAATA 
ACGACTACAC TGTGAGCAAG 



GGGTCCGGCA GTGT' '( 'CAc ■ Gl't^C 'GAGC 'AG ( K JGt Jf 'XYVACXl 
AGGGC«2CTGC AG 




(2) INFORMATION FOR SEQ ID NO : 7 : 
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( i ) SEQUENCE CHARACTER ISTI CS : 
(A) LENGTH : 25 base pairn 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(li J MOLECULE TYPE: Genomic DNA 
(in) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CGATTTAAAT TAATTAAGCC CGGGC 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
TAAATTTAAT TAATTCGGGC CCGTCGA 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(D) OTHER INFORMATION IL-2 signal sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATG TAG AGG ATG CAA CTC CTG TCT TGC ATT GCA CTA AGT CTT GCA CTT 

Met Tyr Arg Met Gin Leu Leu Ser Cys He Ala Leu Ser leu Aia leu 8 

GTC ACA AAC AGT GCA CCT ACT 

Val Thr Asn Ser Ala Pro Thr 69 



2 7 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 
( ix) FEATURE : 

(A) NAME/KEY: Coding .Sequence 

(B) LOCATION: 1 . . . 942 

(D) OTHER INFORMATION: Renilla Reinformis Lucif erase 

(x) PUBLICATION INFORMATION: 

PATENT NO.: 5,418,155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AGC TTA AAG ATG ACT TCG AAA GTT TAT GAT CCA GAA CAA AGG AAA CGG 4 8 

Ser Leu Lys Met Thr Ser Lys Val Tyr Asp Pro Glu Gin Arg Lys Arg 
15 10 15 

ATG ATA ACT GGT CCG CAG TGG TGG GCC AGA TGT AAA CAA ATG AAT GTT 9 6 

Met lie Thr Gly Pro Gin Trp Trp Ala Arg Cys Lys Gin Met Asn Val 
20 25 30 

CTT GAT TCA TTT ATT AAT TAT TAT GAT TCA GAA AAA CAT GCA GAA AAT 14 4 

Leu Asp Ser Phe lie Asn Tyr Tyr Asp Ser Glu Lys His Ala Glu Asn 
35 40 45 

GCT GTT ATT TTT TTA CAT GGT AAC GCG GCC TCT TCT TAT TTA TGG CGA 192 
Ala Val lie Phe Leu His Gly Asn Ala Ala Ser Ser Tyr Leu Trp Arg 
50 55 60 

CAT GTT GTG CCA CAT ATT GAG CCA GTA GCG CGG TGT ATT ATA CCA GAT 
His Val Val Pro His lie Glu Pro Val Ala Arg Cys lie He Pro Asp 
65 70 75 80 

CTT ATT GGT ATG GGC AAA TCA GGC AAA TCT GGT AAT GGT TCT TAT AGG 288 
Leu He Gly Met Gly Lys Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg 
85 90 95 

TTA CTT GAT CAT TAC AAA TAT CTT ACT GCA TGG TTG AAC TTC TTA ATT 336 
Leu Leu Asp His Tyr Lys Tyr Leu Thr Ala Trp Leu Asn Phe Leu He 
100 105 110 

TAC CAA AGA AGA TCA TTT TTT GTC GGC CAT GAT TGG GGT GCT TGT TTG 3 84 

Tyr Gin Arg Arg Ser Phe Phe Val Gly His Asp Trp Gly Ala Cys Leu 
115 120 125 

GCA TTT CAT TAT AGC TAT GAG CAT CAA GAT AAG ATC AAA GCA ATA GTT 4 32 

Ala Phe His Tyr Ser Tyr Glu His Gin Asp Lys He Lys Ala He Val 
130 135 140 

CAC GCT GAA AGT GTA GTA GAT GTG ATT GAA TCA TGG GAT GAA TGG CCT 4 80 

His Ala Glu Ser Val Val Asp Val He Glu Ser Trp Asp Glu Trp Pro 
145 150 155 160 

GAT ATT GAA GAA GAT ATT GCG TTG ATC AAA TCT GAA GAA GGA GAA AAA 52 8 

Asp lie Glu Glu Asp He Ala Leu He Lys Ser Glu Glu Gly Glu Lys 
165 170 175 

ATG GTT TTG GAG AAT AAC TTC TTC GTG GAA ACC ATG TTG CCA TCA AAA 576 
Met Val Leu Glu Asn Asn Phe Phe Val Glu Thr Met Leu Pro Ser Lys 
180 185 190 

ATC ATG AGA AAG TTA GAA CCA GAA GAA TTT GCA GCA TAT CTT GAA CCA 624 
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He Met Arg Lys Leu Glu Pro Glu Glu Phe Ala Ala Tyr Leu Glu Pre 
l 20 0 2 0 l ; 

TTC AAA GAG AAA GGT GAA GTT CGT CGT CCA ACA TTA TCA TGG CCT CGT 67 2 

Phe Lys Glu Lys Gly Glu Val Arg Arg Pro Thr Leu Ser Trp Pro Arg 
210 215 220 

GAA ATC CCG TTA GTA AAA GGT GGT AAA CCT GAC GTT GTA CAA ATT GTT 72 0 

Glu He Pro Leu Val Lys Gly Gly Lys Pro Asp Val Val Gin He Val 
225 230 235 240 

AGG AAT TAT AAT GCT TAT CTA CGT GCA AGT GAT GAT TTA CCA AAA ATG 76 8 

Arg Asn Tyr Asn Ala Tyr Leu Arg Ala Ser Asp Asp Leu Pro Lvs Met 
245 250 255 

TTT ATT GAA TCG GAT CCA GGA TTC TTT TCC AAT GCT ATT GTT GAA GGC 816 
Phe He Glu Ser Asp Pro Gly Phe Phe Ser Asn Ala He Val Glu Gly 
260 265 270 

GCC AAG AAG TTT CCT AAT ACT GAA TTT GTC AAA GTA AAA GGT CTT CAT 864 
Ala Lys Lys Phe Pro Asn Thr Glu Phe Val Lys Val Lys Gly Leu His 
275 280 285 

TTT TCG CAA GAA GAT GCA CCT GAT GAA ATG GGA AAA TAT ATC AAA TCG 912 
Phe Ser Gin Glu Asp Ala Pro Asp Glu Met Gly Lys Tyr He Lys Ser 
290 295 300 

TTC GTT GAG CGA GTT CTC AAA AAT GAA CAA TAA 94 5 

Phe Val Glu Arg Val Leu Lys Asn Glu Gin 
305 310 



(2) INFORMATION FOR SEQ ID NO ill: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 

( iii ) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTTGAATTC A TGTACAGGAT GCAACTCCTG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 2 : 
TTTGAATTCA GTAGGTGCAC TGTTTGTCAC 

(2) INFORMATION FOR SEQ ID NO : 1 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1434 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCTCCACGCA CGTTGTGATA TGTAGATGAT AATCATTATC AGAGCAGCGT TGGGGGATAA 6 0 

TGTCGACATT TCCACTCCCA ATGACGGTGA TGTATAATGC TCAAGTATTC TCCTGCTTTT 1^0 

TTACCACTAA CTAGGAACTG GGTTTGGCCT TAATTCAGAC AGCCTTGGCT CTGTCTGGAC 180 

AGGTCCAGAC GACTGACACC ATTAACACTT TGTCAGCCTC AGTGACTACA GTCATAGATG ^40 

AACAGGCCTC AGCTAATGTC AAGATACAGA GAGGTCTCAT GCTGGTTAAT CAACTCATAG 3 00 

ATCTTGTCCA GATACAACTA GATGTATTAT GACAAATAAC TCAGCAGGGA TGTGAACAAA 3 60 

AGTTTCCGGG ATTGTGTGTT ATTTCCATTC AGTATGTTAA ATTTACTAGG ACAGCTAATT 420 

TGT C AAAAAG TCTTTTTCAG TATATGTTAC AGAATTGGAT GGCTGAATTT GAACAGATCC 4 80 

TTCGGGAATT GAGACTTCAG GTCAACTCCA CGCGCTTGGA CCTGTCGCTG ACCAAAGGAT 54 0 

TACCCAATTG GATCTCCTCA GCATTTTCTT TCTTTAAAAA ATGGGTGGGA TTAATATTAT 6 00 

TTGGAGATAC ACTTTGCTGT GGATTAGTGT TGCTTCTTTG ATTGGTCTGT AAGCTTAAGG ^60 

C C C AAAC TAG GAGAGACAAG GTGGTTATTG CCCAGGCGCT TGCAGGACTA GAACATGGAG V'O 

CTTCCCCTGA TATATGGTTA TCTATGCTTA GGCAATAGGT CGCTGGCCAC TCAGCTCTTA -/yfj 

TATCCCACGA GGCTAGTCTC AT TGT ACGGG ATAGAGTGAG TGTGCTTCAG CAGCCCGAGA 84 0 

GAGTTGCAAG GCTAAGCACT GCAATGGAAA GGCTCTGCGG CATATATGTG CCTATTCTAG 90 0 

GGGGACATGT CATCTTTCAT GAAGGTTCAG TGTCCTAGTT CCCTTCCCCC AGGCAAAACG </«;<) 

ACACGGGAGC AGGTCAGGGT TGCTCTGGGT AAAAGCCTGT GAGCCTGGGA GCTAATCCTG 1CPG 

TACATGGCTC CTTTACCTAC ACACTGGGGA TTTGACCTCT ATCTCCACTC TCATTAATAT 10 8') 

GGGTGGC CTA TTTGCTCTTA TTAAAAGGAA AGGGGGAGAT GTTGGGAGCC GCGCCCACAT 114 0 

TCGCCGTTAC AAGATGGCGC TGACAGCTGT GTTCTAAGTG GTAAACAAAT AATCTGCGCA 12 0 0 

TGTGCCGAGG GTGGTTCTTC ACTCCATGTG CTCTGCCTTC CCCGTGACGT CAACTCGGCC 12 6 0 

GATGGGCTGC AGCCAATCAG GGAGTGACAC GTCCTAGGCG AAGGAGAATT CTCCTTAATA 132 0 

GGGACGGGGT TTCGTTCTCT CTCTCTCTCT TGCTTCTCTC TCTTGCTTTT TCGCTCTCTT 13 80 

GCTTCCCGTA AAGTGATAAT GATTATCATC TACATATCAC AACGTGCGTG GAGG 14 34 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

CCTCCACGCA CGTTGTGATA TGTAGATGAT AATCATTATC AGAGCAGCGT TGGGGGATAA 6 0 

TGTCGACATT TCCACTCCCA ATGACGGTGA TGTATAATGC TCAAGTATTC TCCTGCTTTT 1^0 

TTACCACTAA CTAGGAACTG GGTTTGGCCT TAATTCAGAC AGCCTTGGCT CTGTCTGGAC 180 

AGGTCCAGAT ACAACTAGAT GTATTATGAC AAATAACTCA GCAGGGATGT GAACAAAAGT 24 0 
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TTCCGGGATT 
CAWAAGTCT 
GGGAATTGAG 
CCAATTGGAT 
GAGATACACT 
AAACTAGGAG 
CCCCTGATAT 
AGGCTAGTCT 
GGCTAAGCAC 
TCATCTTTCA 
CAGGTCAGGG 
CCTTTACCTA 
ATTTGCTCTT 
CAAGATGGCG 
GGTGGTTCTT 
CAGTCAATCA 
TTTCGTTTTC 
TGTAAGAATA 
TGAGAACGCG 
TATCACAACG 



GCGTGTTATT 
TTTCCAGTAT 
ACTTCAGGTC 
CTCCTCAGCA 
TTGCTGTGGA 
AGACAAGGTG 
ATCTATGCTT 
CATTGCACGG 
TGCAATGGAA 
AGAAGGTTGA 
TTGGTCTGGG 
CACACTGGGG 
ATTAAAAGGA 
CTGACAGCTG 
CACTCCATGT 
GGGAGTGACA 
TCTCTCTCTT 
AAGCTTTGCC 
TCTAATAACA 
TGCGTGGAGG 



TCCATCCAGT 
ATGTTACAGA 
AACTCCACGC 
TTTTCTTTCT 
TTAGTGTTGC 
GTTATTGCCC 
AGGCAATAGG 
GATAGAGTGA 
AGGCTCTGCG 
GTGTCCAAGT 
TAAAAGCCTG 
ATTTGACCTC 
AAGGGGGAGA 
TGTTCTAAGT 
GCTCTGCCTT 
CGTCCTAGGC 
GCTTCGCTCT 
GCAGAAGATT 
ATTGGTGCCG 



ATijTTAAATT 
ATTGGATGGC 
GCTTGGACCT 
TTAAAAAATG 
TTCTTTGATT 
AGGCGCTTGC 
TCGCTGGCCA 
GTGTGCTTCA 
GCATATATGA 
GTCCTTCCTC 
TGAGCGTAAG 
TATCTCCACT 
TGTTGGGAGC 
GGTAAACAAA 
CCCCGTGACG 
GAAGGAAAAT 
CTCTTGCTTC 
CTGGTCTGTG 
AAACCCGGGT 



TACTAGGGCA 
TGAATTTGAA 
GTCCCTGACC 
GGTGGGATTA 
GGTCTGTAAG 
AGGACTAGAA 
CTCAGCTCTT 
GCAGCCCGAG 
GCCTATTCTA 
CAGGCAAAAC 
AGCTAATCCT 
CTCATTAATA 
CGCGCCCACA 
TAATCTGCGC 
TCAACTCGGC 
TCTCCTTAAT 
TTGCTCTCTT 
GTGTTCTTCC 
GATAATGATT 



GCTAATTTGT 
CAGATCCTTf ' 
AAAGGATTAC 
ATATTATTTG 
CTTAAGGCCC 
CATGGAGCTT 
ATATCCCATG 
AGAGTTGCAG 
GGGAGACATG 
GACACGGGAG 
GTACATGGCT 
TGGGTGGCCT 
TTCGCCGTTA 
ATGCGCCGAG 
GGATGGGCTG 
AGGG AC GGGG 
TTCCTGAAGA 
TGGCCGGTCG 
ATCATCTACA 



3 f ) 0 
i> » C ) 

42 u 

4 H C > 

5 4 0 

6 ( > (J 

6 0 

7 JO 

7 8 0 

8 40 

9 0 0 
9 C 0 

1020 
1 0 8 0 
1140 

1 2 0 0 
1260 
132 0 

13 80 

1 4 ( J 0 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1369 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TO POLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
{ i i i ) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CCTCCACGCA 
ACAACTGCCA 
ACACATTACT 
ATTGTGTGAA 
TGGCATAATA 
ATACAAATAA 
AC C T AAT AAA 
AATTTATTTT 
ATGTTCAATT 
TCAAATATGT 
TTCCTTCGTT 
CTCTAACCAT 
ATGCCAGAGA 
GTTCCTTGGA 
CAGTGCCTCC 
GTGAACTGGT 
ACTATTTCTA 
TTACATTGTT 
CCTAGACCCT 
TACGGATGTT 
CACTTATTGG 
ATTAAACATC 
ATATAAAGTG 



CGTTGTGATA 
CGAGGCCCCG 
CAAGGTGGGC 
AT C AT AAAT C 
AACAAACACA 
ATCATGGTAG 
AATTAAATTA 
CTTTCTTAAT 
CTTCACTGTA 
TACACTTGAT 
ACTTGATATT 
TTTATAACAT 
GAACTTGAGC 
AACTCAGGAT 
ATCTTGCTAG 
CATTCACAGC 
CTAGGTGCTA 
TTCTATAGAT 
TTCTTCACTC 
GATTTTGTGC 
GAATGTTTTT 
AGTCCCAAAT 
ATAATGATTA 



TGTAGATGAT 
TGCTCTGGTA 
TCATCTGAGC 
TTCAAAGTGA 
AATTATCTAT 
ATAACAGTAA 
AGAAGGTGTG 
CCTTTATAAT 
ATGAAGAAAT 
TCCGTGTGTT 
ATTACAAGGT 
ACTTTATTTA 
AGAT7AAGAGG 
CAAATTCAGG 
CCCTGAACAT 
ATGCATGGTG 
GCTAGTACTG 
CCTTGATACT 
TTATTAAATA 
AGTTCTACTA 
GTTAATAAAA 
TACAAACTTC 
TCATCTACAT 



AATCATTATC 
ATAGATCTTT 
TGCAGATTCA 
CACTCATTGC 
TATATAAAGG 
GTTGAGAGCT 
AAT AT AC T AC 
GTTTTCTGCT 
GTAGTAAATA 
ACTTATCATT 
ACATATTTAT 
TTCATTTCTT 
ACAAATTGCA 
TTGTCAGGCT 
CAAGCTTTGC 
ATTTATTGTT 
TATTTCGAGA 
CTTTCAGCAG 
CTAAGTACAA 
TGATAATAGT 
GATTCAGGTG 
AATAAAAGAT 
ATCACAACGT 



ACTTTACGGG 
GCTGAAAAGG 
GCTTAATATG 
CAGACACAGG 
GTGTTAGAAG 
TAAATTTAAT 
AGTAGGTAAA 
ATTGTCAATT 
TACTTTCCGA 
TTATTATTAT 
TCTCTCAGAT 
ATGTGTGCTG 
AGAGTCAGTT 
TGGCAGCATG 
AGACAGACAG 
ACTTTCTATT 
TAG7AAGTTAC 
ATATAGAGTT 
ATTAAGTTTA 
CTAGCTTCAT 
TTACTCTAGG 
TTGACTCTCC 
GCGTGGAGG 



TCCTTTCACT 
CACACACATG 
AATCTTGCCA 
TGCCCACCTT 
ATGCTTTAGA 
AAAGTGATAT 
TTATTTCATT 
GCACATCCAT 
ACAAGTTGTA 
ATTGATTGCA 
CTTCATTATA 
TGAGGCACAA 
ACCTCCTGCT 
CACTTTTTAC 
GCTACACTAA 
CCATGCCTTT 
TGAAAGAAAA 
TTAATCAGGT 
TCCAAAACAG 
AAATCTGACA 
TCAAGAGAAT 
AGTGGTGGCA 



.12 0 
18 0 
240 
300 
360 
420 
4 8 0 
b4 0 
6 0 0 
6 6 0 
72 0 
780 
84 0 
9 0 0 
!>60 
1020 
1080 
114 0 
1200 

1 2 6 0 

1 3 2 0 
1>>69 



(2) INFORMATION FOR SEQ ID NO: 16: 



!i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22118 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 i nea r 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL : NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

GAATTCCCCT ATCCCTAATC CAGATTGGTG GAATAACTTG GTATAGATGT TTGTGCATTA 6 0 

AAAACCCTGT AGGATCTTCA CTCTAGGTCA CTGTTCAGCA CTGGAACCTG AATTGTGGCC 12 0 

CTGAGTGATA GGTCCTGGGA CATATGCAGT TCTGCACAGA CAGACAGACA GACAGACAGA 180 

CAGACAGACA GACAGACGTT ACAAACAAAC ACGTTGAGCC GTGTGCCAAC ACACACACAA 24 0 

ACACCACTCT GGCCATAATT ATTGAGGACG TTGATTTATT ATTCTGTGTT TGTGAGTCTG 30 0 

TCTGTCTGTC TGTCTGTCTG TCTGTCTGTC TATCAAACCA AAAGAAACCA AACAATTATG 3 60 

CCTGCCTGCC TGCCTGCCTG CCTACACAGA GAAATGATTT CTTCAATCAA TCTAAAACGA 42 0 

CCTCCTAAGT TTGCCTTTTT TCTCTTTCTT TATCTTTTTC TTTTTTCTTT TCTTCTTCCT 4 80 

TCCTTCCTTC CTTCCTTCCT TCCTTCCTTT CTTTCTTTCT TTCTTTCTTT CTTACTTTCT 54 0 

TTCTTTCCTT CTTACATTTA TTCTTTTCAT ACATAGTTTC TTAGTGTAAG CATCCCTGAC 600 

TGTCTTGAAG ACACTTTGTA GGCCTCAATC CTGTAAGAGC CTTCCTCTGC TTTTCAAATG 66 0 

CTGGCATGAA TGTTGTACCT CACTATGACC AGCTTAGTCT TCAAGTCTGA GTTACTGGAA 72 0 

AGGAGTTCCA AGAAGACTGG TTATATTTTT CATTTATTAT TGCATTTTAA TTAAAATTTA 7 80 

ATTTCACCAA AAGAATTTAG ACTGACCAAT TCAGAGTCTG CCGTTTAAAA GCATAAGGAA 84 0 

AAAGTAGGAG AAAAACGTGA GGCTGTCTGT GGATGGTCGA GGCTGCTTTA GGGAGCCTCG 90 0 

TCACCATTCT GCACTTGCAA ACCGGGCCAC TAGAACCCGG TGAAGGGAGA AACCAAAGCG 96 0 

AC CTGGAAAC AATAGGTCAC ATGAAGGCCA GCCACCTCCA TCTTGTTGTG CGGGAGTTCA 102 0 

GTTAGCAGAC AAGATGGCTG CCATGCACAT GTTGTCTTTC AGCTTGGTGA GGTCAAAGTA 10 80 

CAACCGAGTC ACAGAACAAG GAAGTATACA CAGTGAGTTC CAGGTCAGCC AGAGTTTACA 114 0 

CAGAGAAACC ACATCTTGAA AAAAACAAAA AAATAAATTA AATAAATATA ATTTAAAAAT 12 0 0 

TTAAAAATAG CCGGGAGTGA TGGCGCATGT CTTTAATCCC AGCTCTCTTC AGGCAGAGAT 12 6i; 

GGGAGGATTT CTGAGTTTGA GGCCAGCCTG GTCTGCAAAG TGAGTTCCAG GACAGTCAGG 13 2 0 

GCTATACAGA GAAACCCTGT CTTGAAAACT AAACTAAATT AAACTAAACT AAACTAAAAA 13 8 0 

AATATAAAAT AAAAATTTTA AAGAATTTTA AAAAACTACA GAAATCAAAC ATAAGCCCAC 14 4 0 

GAGATGGCAA GTAACTGCAA TCATAGCAGA AATATTATAC ACACACACAC AC AC AG AC T C 1500 

TGTCATAAAA TCCAATGTGC CTTCATGATG ATCAAATTTC GATAGTCAGT AATACTAGAA 15o0 

GAATCATATG TCTGAAAATA AAAGCCAGAA CCTTTTCTGC TTTTGTTTTC TTTTGCCCCA 162 0 

AGATAGGGTT TCTCTCAGTG TATCCCTGGC ATCCCTGCCT GGAACTTCCT TTGTAGGTTT 16 8 0 

GGTAGCCTCA AACTCAGAGA GGTCCTCTCT GCCTGCCTGC CTGCCTGCCT GCCTGCCTGC 174 0 

CTGCCTGCCT GCCTGCCTCA CTTCTTCTGC CACCCACACA ACCGAGTCGA AC C T AGGATC 1800 

TTTATTTCTT TCTCTTTCTC TCTTCTTTCT TTCTTTCTTT CTTTCTTTCT TTCTTTCTTT 1860 

CTTTCTTTCT TTCTTATTCA ATTAGTTTTC AATGTAAGTG TGTGTTTGTG CTCTATCTGC 192 0 

TGCCTATAGG CCTGCTTGCC AGGAGAGGGC AACAGAACCT AGGAGAAACC ACCATGCAGC 19 80 

TCCTGAGAAT AAGTGAAAAA ACAACAAAAA AAGGAAATTC TAATCACATA GAATGTAGAT 2 04 0 

ATATGCCGAG GCTGTCAGAG TGCTTTTTAA GGCTTAGTGT AAGTAATGAA AATTGTTGTG 2100 

TGTCTTTTAT C C AAAC AC AG AAGAGAGGTG GCTCGGCCTG CATGTCTGTT GTCTGCATGT 2160 

AGACCAGGCT GGCCTTGAAC ACATTAATCT GTCTGCCTCT GCTTCCCTAA TGCTGCGATT 2 22 0 

AAAGGCATGT GCCACCACTG CCCGGACTGA TTTCTTCTTT TTTTTTTTTT TGGAAAATAC 2 2 80 

CTTTCTTTCT TTTTCTCTCT CTCTTTCTTC CTTCCTTCCT TTCTTTCTAT TCTTTTTTTC 2 34 0 

TTTCTTTTTT CTTTTTTTTT TTTTTTTTAA AATTTGCCTA AGGTTAAAGG TGTGCTCCAC 24 0 0 

AATTGCCTCA GCTCTGCTCT AATTCTCTTT AAAAAAAAAC AAACAAAAAA AAAACCAAAA 24 60 

CAGTATGTAT GTATGTATAT TTAGAAGAAA TACTAATCCA TTAATAACTC TTTTTTCCTA 2 52 0 

AAATTCATGT CATTCTTGTT CCACAAAGTG AGTTCCAGGA CTTACCAGAG AAACCCTGTG 2580 

TTCAAATTTC TGTGTTCAAG GTCACCCTGG CTTACAAAGT GAGTTCCAAG TCCGATAGGG 2 64 0 

CTACACAGAA AAAC CAT AT C TCAGAAAAAA AAAAAGTTCC AAACACACAC ACACACACAC 2 70 0 

ACACACACAC ACACACACAC ACACACACAC ACACACACAG CGCGCCGCGG CGATGAGGGG 2 76 0 

AAGTCGTGCC TAAAATAAAT ATTTTTCTGG CCAAAGTGAA AGCAAATCAC TATGAAGAGG 2 82 0 

TACTCCTAGA AAAAATAAAT ACAAACGGGC TTTTTAATCA TTCCAGCACT GTTTTAATTT 2880 

AACTCTGAAT TTAGTCTTGG AAAAGGGGGC GGGTGTGGGT GAGTGAGGGC GAGCGAGCAG 2 94 0 

ACGGGCGGGC GGGCGGGTGA GTGGCCGGCG GCGGTGGCAG CGAGCACCAG AAAACAACAA 3000 

ACCCCAAGCG GTAGAGTGTT TTAAAAATGA GACCTAAATG TGGTGGAACG GAGGTCGCCG 3060 

CCACCCTCCT CTTCCACTGC TTAGATGCTC CCTTCCCCTT ACTGTGCTCC CTTCCCCTAA 312 0 

CTGTGCCTAA CTGTGCCTGT TCCCTCACCC CGCTGATTCG CCAGCGACGT ACTTTGACTT 318 0 
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CAAGAACGAT TTTGCCTGTT TTCACCGCTC CCTGTCATAP TTTCGTTTTT OTTrr. B 

GTCTAGCCCG TTCGCTATGT TCGGGCGGGA CGATGGGGAi cS5rGTCpJ $5^£i 
AGTGGTGGGT GGGTACGCTG CTCCGTCGTG CGTGCGTGAG TGCCGGAAGC Salvrr 

Sg^GtSg G?™cS AC GcSracG GATCTGTA TT 

Zt„ GTT^ATCGAG ACCATTGTCG GGCGACACCT AGTGGTGAPA flPTTTrr/v a 

ACGCTCCAGG CCTCTCAGGT TGGTGACACA GGAGAGGGAA GTto" t^--P 
MGGGTGACAG GAGGCCGGGC AAGCAGGCGG GAGCGTCTCG GAGATGCTCT JSm^r 
GACGGTCTCT AACAAGGAGG TCGTACAGGG AGATGGCCAA AGCAgScCgI GTTGCTPTAG 
GCCCTTTTGG GAAAAATGCT AGGGTTGGTG GCAACGTTAC TAGGtS^AcJ AgSpp^TA 
AGTCCTACCC CCCCCCCCCT TTTTTTTTTT TTTCCTCCAG AAGCCCTCTP 
? AGCGGGGGC ACCGTACATC TGAGGCCGAG AGGACGcSS G^GCCCGgS tcS^GCCPP 
^^^o TCGG CCA °CTGGCG CTTCGGGTCT TTTTTTTTTT TTTTTTTtS 
GAAGCCTTGT CTGTCGCTGT CACCGGGGGC GCTGTACTTC TGAGGCCGAG Irrlrrrn^ 
^^ GGC TTCCAA GCCG GTGTGGCTCG GCCAGCTGGA GCTTCGGGTC 
™™ I™TCTC CAGAA GCCTT GTCTGTCGCT GTCACCGGGG GCgSgSS 

SSSS ?SSSS SSSS SSSgg? SSSSsi 

TACTTCTGAG GCCGAGAGGA CGTGATGGGC CCGGGTTCCA GGCGGmStc G^pSSpIp 
P^p^o GGATCTTT TT TTTTTTTTTT CCTCCAGAAG SScTCTTC ?CCCCCTCAC 
r^™ C TTACATCT GA GGGCGAGAGG ACGTGATGGG TCCGGCTTCC 
GGCGGGGCCA GCTGGAGCTT CGGGTTTTTT TTTTTTCCTC CAGAAGCCCT ^TTCTCrr 

| sssss sssss ssss ssssi il 

= 2= SSSSSS SS5SSS = 

I™ illi S™ ™s ssss S 

iilnAAA^H TTCCTCCAGA AACCTTGTCT GTCGCTGTCA CCCGGGGCGC TTGTAPTTPT 
GATGCCGAGA GGACGCGATG GGCCCGTCTT CCAGGCCGAT GTGGCCCGGT CAG?tppIpp 
TTTGGATCTT TTTTTTTTTT TTTTCCTCCA GAAGCCCTCT CTTGtcSS TcIcIgCCCC 
GAG ™ CAT CTGAGGCCTA GAGGACACGA TGGGCCCGGG TTCCAGGCCG A^CTgSppp 
GTCAGCTGGA GCTTTGGATC TTTTTTTTTT TTTTPTTPPa ATGTGGCCCG 

Grc^r? ctgtacat ^ olllllllll ssss sssss sssss 

G I GGGGGGGT CAGCTGGAGC TTTGGATCTT ATTTTTTTTT TAATTTTTTC TtSgSpp 
^™ TCC ctgtcaccgg TGGCACGGTA CATCTGAGGC CGAGAGGACA SSSgg^pp 
tttttcctcc S? S ccggtcagg ? ggagctttgg atc?™? 

TXTTTCCTCC AGAAGCCCTC TCTGTCCCTG TCACCGGGGG CCCTGTACGT CTCArrrrrt 
GGGAAAGCTA TGGGCGCGGT TTTCTTTCAT TGACCTGTCG GTCTTmSS JScccrPT 
TGTCAGGGTC GACCAGTTGT TCCTTTGAGG TCCGGTTCTT TTCGTtSSS GOTcSSt 
GG ^ GGAGCTC CCCAGGTATG ACTTCCAGGC GTCGTTGCTC GCCTGtSIct TOCT^PTP 

AT GCTTGTGATC ttttctatct gttcctattg gacctggJS SggSctpI 
GAGGGTGTCC tttccctatt aacactaaag gacactataa agagaccct? tcStSp 

TGC TTGTCCAGCC ipATTCTTTTT ACTGGCTTGG GTCTGTcSS CTcSSpa^p 
^ G ^ GGGGGA GCCACGCTTC CTGCTTTCCC GGGCTTGCTG CTTGCGTGTC CTTCCtSSp 
GCAGCTTGTG ACAACTGGGC GCTGTGACTT TGCTGCGTGT CAGACGTTTT SccSSSn 
CCCGAGGTGT CGTTGTCACA CCTGTCCCGG TTGGAATGGT GGAGCCAgS IrcrTTrlrr 
GCCACCTTAT TTCGGCTCAC TTTTTTTTTT TTTTTTTCTC TTGGAGTCCP pIapp^^ 
?SSS" TCCCGGTCTT TCTTCCACAT GCCTCcSS SSSSS? T^ot™ 
^TTTTTT TTTTTTTTTT TTGGGGAGGT GGAGAGTCCC GAGTACTTcI SotgSp 

™ GTCCAA ctgttcatgc cacgtgcctc ccgagtgcac ttttttttgt ggSctpppt 

liiil SSSJSS SSS? ™ |i 

|= sssss? ssssss = gssss li i 

f if? ££££ 0 =S ssssss = 

tcacgtgttt cactttggtc gtgtctcgct tgaccatgtt cccagagtcg ctc7a£p£pp 
ccggtggcgt tgcataccct tcccgtctgg tgtgtgcacg cgctgtt?c? ?g?Sgpp?p 
GAGGTGCTCC tggagcgttc caggtttgtc tcctaggtgc ctgctSga Ict^SgcIc 
tptptp^ CA TTCCCTGGTG tgcctccggt gctccgtctg gctgtgtSc? ?tcc?g??tc 
^J^ GAGAA gcccgt gaga ggggggtcga ggagagaagg AGGGGCAAGA CCCCCCTTPT 

TCGTCGGGTG AGGCGCCCAC CCCGCGACTA GTACGCCTGT GCGTAGGGC? GGTCrrrlrr 
GGTCGCGGCT GGGGTTGGAA AGTTTCTCGA GAGACTCATT GCTTTCCCGT GGGGaJptt? 

GAGAGGCCTG gctttcgggg GGGACCGGTT GCAGGGTCTC ccctctccgc gg???Spap 
aatgcccttg gaagagaacc ttcctgttgc cgcagacccc ccScgSg? cg^cgJgtc 



3.^4 0 
"■! '.0 0 

-60 

M20 
;4 80 
3 : ,4 0 

:jooo 

!"i b '5 0 
3 7 CO 

: s 7 a o 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

43 8 0 

4440 

4 5 00 

4560 

4 62 0 

4 6 8 0 

4 74 0 

■1800 

4860 

4 92 0 

4 98 0 

-.04 , 

MO ■ 

5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

64 8 0 

654U 

5600 

6660 

6720 

6780 

6840 

6900 
6960 
7020 
7080 
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TTGGTCTTCT 
GTCGGGGTTT 
GGAAAGGGTG 
CTCGCCGCCT 
GGCCTCCCGG 
CCGTTGCTGC 
GCACACCCCC 
TGGGTAGGCG 
TCCGTCGCGT 
CTGCGCCGCG 
CCCCCCTTCC 
CCTCGGGGTC 
GTTCTGTGGG 
GCCGCTCGGG 
CGGTGTCGCC 
GGTGTGGTGG 
TGCCCTGACC 
GAGGGGCCCG 
CCCCCTCCCC 
ACCCGTGGCC 
CGGTCACCGG 
GAGCTGTGGT 
GAGAGGGCTG 
AGTGGTCATT 
CCGGCCCTGT 
ACCCTGGCGG 
GATGTCTACC 
CCTCGTTCCT 
CATCTCTCGC 
TCGCCGGGGG 
CTCGCCGGCT 
GACGTTGCGC 
GAGCCCCTGC 
TGTGTCGCGT 
GACGGGTGGC 
TCGTTGGTGT 
TCGCCGGTGT 
CGGCCCGGTG 
GGGACGGAGG 
GTTGGCTTTG 
TCCGGCCGCA 
CCTCCCGCGA 
CCTGGTCCTG 
GGTAGCATAT 
AGTGAAACTG 
CTACTTGGAT 
TCCCGGGGGG 
CTCCGGCCGG 
ACGCCCCCCG 
TCGCCGTGCC 
GAGCCTGAGA 
CGACCCGGGG 
GGAATGAGTC 
CAGCCGCGGT 
TAGTTGGATC 
CCCCTTGCCT 
GTTTACTTTG 
AGGAATAATG 
TAAGAGGGAC 
GCAAGACGGA 
TCGGAGGTTC 
CGATGCGGCG 
GGTTCCGGGG 
CCAGGAGTGG 
CGGACAGGAT 



GGTTTCCCTG 
TGGGTCCGTC 
CGGGCTTCTT 
GACCGCCTCC 
CTCCGAGTTC 
GGAGCATGTG 
GCGTGCGCGT 
ACGGTGGGCT 
GCGTCCCTCT 
CGTGGTGCGT 
CGCGGCAGCG 
GAGAGGGTCC 
AGAACGGCTG 
GGTCTTCGTC 
TCCTCGGGCT 
GACTGCTCAG 
GGTCCGACGC 
TTTCGGCCGC 
GCTCGCCGCA 
GTGCTGTCGG 
GGTCTTGGGG 
TTGGAGGGCG 
CGTGCGAGGG 
GTCCCGACGG 
CGTCCGTCGG 
TGGGATTAAC 
TCCCTCTCCC 
CCCTCTCGCG 
GCAATGGCGC 
CTGGCCGCTG 
TCGCGGACTC 
CTCGCTGCTG 
CGCACCCGCC 
CGGGAGCGTG 
CTATCCAGGG 
GGGGAGTGAA 
CGCGCTTCTC 
CGGTCGACGT 
GGAGAGCGGG 
CCGCGTGCGT 
TGCACTCTCC 
GGCTCTCCGC 
TCCCACCCCC 
GCTTGTCTCA 
CGAATGGCTC 
AACTGTGGTA 
GGATGCGTGC 
GGGTCGGGCG 
TGGCGGCGAC 
TACCATGGTG 
AACGGCTACC 
AGGTAGTGAC 
CACTTTAAAT 
AATTCCAGCT 
TTGGGAGCGG 
CTCGGCGCCC 
AAAAAATTAG 
GAATAGGACC 
GGCCGGGGGC 
CCAGAGCGAA 
GAAGACGATC 
GCGTTATTCC 
GGAGTATGGT 
GCCTGCGGCT 
TGACAGATTG 



TGTGCTCGTC 
CCGCCCTCAG 
ACGGTCTCGA 
CGCGCGCGCA 
GGGGAGGGAT 
GCTCGGCTTG 
ACTTTCCTCC 
CCCGGGTCCC 
CGCTCGCGTC 
GGTGTGTGCT 
TTGCCACGGC 
GTGTCTGGCG 
TTGGCCGCGT 
GGTAGGCATC 
CCCGGGGGGC 
GGGAGTGGTG 
CCGAGCGGTC 
CCTTGCCGTC 
GCCGGTCTTT 
ACCCCCCGCA 
GGGGGCCGAG 
TCCCGGCCCC 
GAAAAGGTTG 
TGTGGTGGTC 
GAAGGCGCGT 
CCCGCGCGCG 
CGAGGTCTCA 
GGGTTCAAGT 
CGCCCGAGTT 
TCCGGTCTCT 
CTGGCTTCGC 
TGTGCTTGGG 
GGTGTGCGGT 
TCCGCCTCGC 
CTCGCCCCCG 
TGGTGCTACC 
TTTCCGCCAA 
TCCGGCTCTC 
TAAGAGAGGT 
GTGCTCGCGG 
CGTTCCGCGC 
CGCCGCCGCC 
GACGCTCCGC 
AAGATTAAGC 
ATTAAATCAG 
ATTCTAGAGC 
ATTTATCAGA 
CCGGCGGCTT 
GACCCATTCG 
ACCACGGGTG 
ACATCCAAGG 
GAAAAATAAC 
CCTTTAACGA 
CCAATAGCGT 
GCGGGCGGTC 
CCTCGATGCT 
AGTGTTCAAA 
GCGGTTCTAT 
ATTCGTATTG 
AGCATTTGCC 
AGATACCGTC 
CATGACCCGC 
TGCAAAGCTG 
TAATTTGACT 
ATAGCTCTTT 



GOATGCATCC 
TGAGAAAGTT 
GGGGTCTOTG 
GCGTTTGCTC 
CACGCGGGGC 
TGTGGTTGGT 
CCTCCTGAGG 
CACCCGTCTT 
CACGACTTTG 
TCTCGGGCTG 
TGGCGAAATC 
TTGATTGATC 
CCGGCGCGAC 
GGTGTGTCGG 
CGTCGTGTTT 
CAGTGTGATT 
TCTCGGTCCC 
GTCGCCGGCC 
TTTCCTCTCT 
TGGGGGCGGC 
GGGTAAGAAA 
GCGGCCGTGG 
CCCCGCGAGG 
TGTTGGCCGA 
GTTGGGGCCT 
TGTCCCGGTG 
GGCCTTCTCC 
CGCTCGTCGA 
CACGGTGGGT 
CCTGCCCGAC 
CCGGAGGGTC 
GGGGGCCCGC 
TTCGCGCCGC 
GGCGGCTAGA 
CCGACCCCCG 
GGTCATTCCC 
CCCCCACGCC 
CCGATGCCGA 
GTCGGAGAGC 
ACGGGTTTTG 
GAGCGCCCGC 
TCCTCCTCCT 
TCGCGCTTCC 
CATGCATGTC 
TTATGGTTCC 
TAATACATGC 
TCAAAACCAA 
GGTGACTCTA 
AACGTCTGCC 
ACGGGGAATC 
AAGGCAGCAG 
AATACAGGAC 
GGATCCATTG 
ATATTAAAGT 
CGCCGCGAGG 
CTTAGCTGAG 
GCAGGCCCGA 
TTTGTTGGTT 
CGCCGCTAGA 
AAGAATGTTT 
GTAGTTCCGA 
CGGGCAGCTT 
AAACTTAAAG 
CAACACGGGA 
CTCGATTCCG 



TGTCTCGGTG 
TCCTTCTCTA 
CCGAATGGTG 
TCTCGTCTAC 
AGAGCCTGTC 
GGCTGGGGAG 
GCCGCCGTGC 
CCCGTGCCTC 
GCCGCTCCCG 
TGTGGTTGTG 
GCGGGAGTCC 
TCGCTCTCGG 
GTCGGACGTG 
CATCGGTCTC 
CGGGTCGGCT 
CCCGCCGGTT 
TTGTGAGGAC 
CTCGTTCTGC 
CCCCCCCTCT 
CGGGCACGTA 
GTCGGCTCGG 
CGGTGTGTTG 
GCAAAGGGAA 
GGTGCGTCTG 
GCCGGAGTGC 
TGGCGGTGGG 
GCGCGGGCTC 
CCTCCCCTCC 
TCGTCCTCCG 
CCCCGTTGGC 
AGGGGGCTTC 
TGCGGCCTCC 
GGTCAGTTGG 
CGCGGGTGTC 
CCTGCCCGTC 
TCCCGCGTGG 
AACCCACCAC 
GGGGTTCGGG 
TGTCCCGGGG 
TCGGACCCCG 
CCGGCTCACC 
CTCTCGCGCT 
TTACCTGGTT 
TAAGTACGCA 
TTTGGTCGCT 
CGACGGGCGC 
CCCGGTGAGC 
GATAACCTCG 
CTATCAACTT 
AGGGTT C GAT 
GCGCGCAAAT 
TCTTTCGAGG 
GAGGGCAAGT 
TGCTGCAGTT 
CGAGTCACCG 
TGTCCCGCGG 
GCCGCCTGGA 
TTCGGAACTG 
GGTGAAATTC 
TCATTAATCA 
CCATAAACGA 
CCGGGAAACC 
GAATTGACGG 
AACCTCACCC 
TGGGTGGTGG 
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TTCTTAGTTG 
CTAACTAGTT 
GCGTTCAGCC 
TGCACGCGCG 
AACCCGTTGA 
GAATTCCCAG 
ACCGCCCGTG 
GGTCGGCCCA 
AGTAAAAGTG 
CTGTGGAGGA 
CGCGTGCGTC 
GAAGGGGTGG 
TCCCCTCTCG 
GCGTCTTGCC 
GGTTTTTGAC 
CCCATCCCCG 
GGATGTGAGT 
GTCCTCCCCG 
CCCCGGGGGG 
CGGTCGTTCG 
CCCGAGGCGG 
CCCGACCCGC 
GGGTTCCCGT 
CACGTGTCTC 
CCTCTCTCTC 
CGTGAGTTCG 
TGCGTCGATG 
CATCGACACT 
CGTCGGTTGA 
CTCGCAGGGC 
GGGCGGTTGT 
CGCGCTCGCG 
GCCTCGCGTC 
TGGGAACCCA 
GAGGTTGGCG 
GGTTGTCGGG 
GTTTGGGTCT 
GGCGCCGCGC 
GTATCCCCGG 
CCTCGGTGGG 
CGTGGCTCTT 
CCGCGGGACG 
GGGAGGGAGA 
CTGTGGGCTG 
CCCTCCCGCC 
GCCGGGTGCC 
TGTCCCCCCT 
ATTAGTCAGC 
GAAGAGCCCA 
GACCCACTCC 
TGGACGGTGT 
GTTGCTTGGG 
CGAGACCGAT 
TCAAGAGGGC 
GATTCAACCC 
CCCCGTTCCT 
GCCTCCGGCG 
GGGTCGGCGG 
GGCGGTGCGC 
GGGGGGGCGG 
GGCCGCGCTT 
CTCTCCCCCC 
GGCGCGACCG 
CGGACTGTCC 
GTCACGCGTC 



GTGGAGCGAT 
AGGCGACCCG 
ACCCGAGATT 
CTACACTGAC 
ACCCCATTCG 
TAAGTGCGGG 
GCTACTACCG 
CGGCCCTGGC 
GTAACAAGGT 
GCGGCGGCGT 
CCGGGTCCCG 
GTGGGGTCGG 
CTCGTCCGGC 
TCTTTCCCGT 
CCGTCCCGGG 
CCGCGGCTCT 
GTCGCGTGTG 
CTCCTGTCCC 
GTCGCCCTGC 
GGCGGCTCTC 
CGGTCGTGTG 
GCCGCCGGCT 
GTCGTTCCCG 
GTTTCGTTCC 
CGGGGAGAGG 
CTCACACCCG 
AAGAACGCAG 
TCGAACGCAC 
CGATCAATCG 
CAACCCCCCA 
CGGTGTGGCG 
GCTTCTTCCC 
GGCGCCTCCC 
CCGCGCCCCC 
GTTGAGGGTG 
GTGGCGGTCG 
TGCGCTGGGG 
ACCCTCCGGC 
TGGCGTTGCG 
CGCCTTCGCG 
CTTCGTCTCC 
CCGCGGCGTC 
GGGCCTCGCT 
TGCGTCCCGG 
GGCCTCTCGG 
GTCTCTTTCC 
TTCTGACCGC 
GGAGGAAAAG 
GCGCCGAATC 
CCGGCGCCGC 
GAGGCCGGTA 
AATGCAGCCC 
AGTCAACAAG 
GTGAAACCGT 
GGCGGCGCGC 
CCCGACCCCT 
GCGGGCGCGG 
GGGACCGCCC 
CGCGACCGGC 
CGCGTCTCAG 
TCGCCGAATC 
GTCCGCCTCC 
CTCTCCCACC 
CCAGTGCGCC 
TCCCGACGAA 



TTGTCTGGTT 
CGAGCGGTCG 
GAGCAATAAC 
TGGCTCAGCG 
TGATGGGGAT 
TCATAAGCTT 
ATTGGATGGT 
GGAGCGCTGA 
TTCCGTAGGT 
GGCCCGCTCT 
TGGCGCGCGT 
TCTGGGTCCG 
TCTGACCTCG 
CCGGCTCTTC 
GGGGTTCGGT 
GGCTTTTCTA 
GGCTCGCCCG 
GGGTACCTAG 
CGCCCCCAGG 
CCTCAGACTC 
GGGGGGTGGA 
TGCCCGATTT 
TGTTTTTCCG 
TGCTGGCCGG 
AGGGCGGTGG 
AAATACCGAT 
CTAGCTGCGA 
TTGCGGCCCC 
CGTCACCCGC 
ACCCGGGTCG 
CGCGCGCCCG 
GCTCCGCCGT 
GGACCGCTGC 
GTGGCGCCCG 
TGCGTGCGCC 
ACGAGGGCCG 
GAGGCGGGGT 
TTGTGTGGAG 
AGGGAGGGTT 
CCGCACGCGG 
GCTTCTCCTT 
CGTGCGCCGA 
GACCCGTTGC 
GGGTTGCGTG 
GGACCCCCTG 
CGCCCGCCTC 
GACCTCAGAT 
AAACTAACCA 
CCCGCCGCGC 
TCGTGGGGGG 
GCGGCCCCGG 
AAAGCGGGTG 
TACCGTAAGG 
TAAGAGGTAA 
GTCCGGCCGT 
CCACCCGCGC 
GGGGTGGTGT 
CCGGCCGGCG 
TCCGGGACGG 
GGCGCGCCGA 
CCGGGGCCGA 
CGGGCGGGCG 
CCCCTCCGTC 
CCGGGCGTCG 
GCCGAGCGCA 



AATTCCGATA 
GCGTCOCCCA 
AGGTCTGTGA 
TGTGCCTACC 
CGGGGATTGG 
GCGTTGATTA 
TTAGTGAGGC 
GAAGACGGTC 
GAACCTGCGG 
CCCCGTCTTG 
GTGGAGCGAG 
TCTGGGACCG 
CCACCCTACG 
CGTGTCTACG 
CGTCGGGGCG 
CGTTGGCTGG 
TCCGGATGCC 
CTGTCGCGTT 
GTCGGGGGGC 
CATGACCCTG 
TGTCTGGAGC 
CCGCGGGTCG 
CTCCCGACCC 
CCTGAGGCTA 
TCGTTGGGGG 
ACGACTCTTA 
GAATTAATGT 
GGGTTCCTCC 
TGCGGTGGGT 
GGCCCTCCGT 
CGTCGCGGAG 
TGCCGCCCTC 
CTCACCAGTC 
GGGGTGGGCG 
GAGGTGGTGG 
GTCGGTCGCC 
CGACCGCTCG 
GGAGAG CGAG 
TGGCGTCCCG 
CCGCTAGGGG 
CACCCGGGCG 
TGCGAGTCAC 
GTCCCGGCTT 
TGAGTAAGAT 
AGACGGTTCG 
CTCGCTCTCT 
CAGACGTGGC 
GGATTCCCTC 
GTCGCGGCGT 
CCCAAGTCCT 
CGCGCCGGGC 
GTAAACTCCA 
GAAAGTTGAA 
ACGGGTGGGG 
GCCCGGTGGT 
GTCGTTCCCC 
GGTGGTGGCG 
ACCGGCCGCC 
CCGGGAAGGC 
ACCACCTCAC 
GGAAGCCAGA 
TGGGGGTGGG 
GCCTCTCTCG 
TCGCGCCGTC 
CGGGGTCGGC 



ACGAACGAGA 
ACTTCTTAGA 
TGCCCTTAGA 
CTGCGCCGGC 
AATTATTCCC 
AGTCCCTGCC 
CCTCGGATCG 
GAACTTGACT 
AAGGATCATT 
TGTGTGTCCT 
GTGTCTGGAG 
CCTGCGATTT 
GCGGCGGCGG 
AGGGGCGGTA 
CGCGCTTTGC 
GGCGGTTGTC 
ACGCTTTTCT 
CCGGCGCGGA 
GGTGGGGCCC 
CTCCCCCCGC 
CCCCTCGGGC 
GTCGTGTCGG 
TTTTTTTTTC 
CCCCTCGGTC 
ACTGTGCCGT 
GCGGTGGATC 
GAATTGCAGG 
CGGGGCTACG 
GCTGCGCGGC 
CTCCCGAAGT 
CCTGGTCTCC 
GCCGGTGCAC 
TTTCTCGGTC 
CGTCCGCATC 
TCGGTCCCCT 
TGCGGTGGTT 
CGGGGTTGGC 
GGCGAGAACG 
CGTCCGTCCG 
CGGTCGGGGC 
GTACCCGCTC 
CCCCGGGTGT 
CCCTGGGGGG 
CCTCCACCCC 
CCGGCTCGTC 
TCTTCCCGCG 
GACCCGCTGA 
AGTAACGGCG 
GGG AAATGTG 
TCTGATCGAG 
TCGGGTCTTC 
TCTAAGGCTA 
AAGAACTTTG 
TCCGCGCAGT 
CCCGGCGGAT 
TCTTCCTCCC 
CGCGGGCGGG 
GCCGGGCGCA 
CCGGTGGGGA 
CCCGAGTGTT 
TACCCGTCGC 
GGCCGGGCCG 
GGGCCCGGTG 
GGGTCCCGGG 
GGCGATGTCG 



GTCTGGCATG 
GGGAGAAGTG 
TGTGCGGGGG 
AGGGGCGGGT 
CATGAACGAG 
CTTTGTACAC 
GCCCCGCCGG 
ATCTAGAGGA 
AAACGGGAGA 
CGCCGGGAGG 
TGAGGTGAGA 
CCCCTCCCCC 
CTGCTCGCGG 
CGTCGTTACG 
TCTCCCGGCA 
GCGTGTGGGG 
GGCCTCGCGT 
GGTTTAAGGA 
GTAGGGAAGT 
TGCCGCCGTT 
GCCGTGGGGG 
TGCCGGTCGT 
CTCCCCCCCA 
CATCTGTTCT 
CGTCAGCACC 
ACTCGGGTCG 
ACACATTGAT 
CCTGTCTGAG 
TGGGAGTTTG 
TCAGACGTGT 
CCCGCGCATC 
CCCGGTCCTG 
CCGTGCCCCG 
TGCTCTGGTC 
GCGGCCGCGG 
GTCTGTGTGT 
GCGGTCGCCC 
GAGAGAGGTG 
TCCCTCCCTC 
CCGTGGCCCC 
CGGCGCCGGG 
TGCGAGTTCG 
GACCCGGCGT 
CGCCGCCCTC 
CTCCCGTGCC 
GCTGGGCGCG 
ATTTAAGCAT 
AGTGAACAGG 
GCGTACGGAA 
GCCCAGCCCG 
CCGGAGTCGG 
AATACCGGCA 
AAGAGAGAGT 
CCGCCCGGAG 
CTTTCCCGCT 
CGCGTCCGGC 
GCCGGGGGTG 
CTTCCACCGT 
AGGTGGCTCG 
ACAGCCCTCC 
CGCGCTCTCC 
CCCCTCCCAC 
GGGGGCGGGG 
GGGACCGTCG 
GCTACCCACC 
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CGACCCGTCT 
GAAAGCCGCC 
CGAGGCCTCT 
AGGTGGAGCA 
CGAAGGCAGA 
CGACCTGGGT 
TTTCCCTCAG 
AATGATTAGA 
AGAAGCCCGG 
TTGGTAAGCA 
GACGCTCATC 
GAAGTCGGAA 
AATGGATGGC 
GGACGGGAGC 
CCCCCGCCTC 
CGCCGCGACG 
TGGAGCCGCC 
AGGCCGAAGT 
AGAGATGGGC 
CGAAAGGGAG 
CAGTGCGGTA 
TTTCTTTGTG 
TTGGAAAGCG 
GGGGAGAGGG 
GAACAGCCTC 
ACTTCGGGAT 
CTGGGCGCGC 
CCCGTCCTTT 
CGTCGTCGCC 
CGCGCGGCGC 
ACCAGCGGTC 
CTCTGGACGC 
GCTCCCGGGG 
CCCCCCATCG 
GGGGGAACCT 
CGCGGCGCCC 
GAATCCGACT 
TGTGATTTCT 
ACGGCGGGAG 
CGCGCATGAA 
GCCAAGGGAA 
AGTCTGGCAC 
GCCCCGTCCT 
CTACTCTCAT 
CGCTTCTGGC 
AGTGCCAGGT 
TAAGGCGAGC 
ATCTTGATTT 
TTTGGGTTTT 
CCAAGCGTTC 
GAAGCAGAAT 
TAGACCGTCG 
CCTGCTCAGT 
GCCAATGGGG 
GCCCAAGCGG 
CCCCGTCCGT 
CCGCCGGGCG 
CGGCCGGAAA 
GGCGCTAAAC 
GCTCCCTCGC 
TTTCCCGTCG 
GCGGTCGCCT 
CGTCTTCTCC 
TGGGTGTGGG 
CTTGCCCTCC 



TGAAACACGG 
GTGGCGCAAT 
CCAGTCCGCG 
CGAGCGTACG 
GGAAACTCTG 
ATAGGGGCGA 
GATAGCTGGC 
GGTCTTGGGG 
CTCGCTGGCG 
GAACTGGCGC 
AGACCCCAGA 
TCCGCTAAGG 
GCTGGAGCGT 
GGCCGCGGGT 
CCCTCCGCGC 
AGTAGGAGGG 
GCAGGTGCAG 
GGAGAAGGGT 
GAGTGCCGTT 
TCGGGTTCAG 
ACGCGACCGA 
AAGGGCAGGG 
TCGCGGTTCC 
TGTAAATCTC 
TGGCATGTTG 
AAGGATTGGC 
GCCGCGGCTG 
CCGCCCGGGC 
ACCTCTCTTC 
GGGCTCCGGG 
CCCGGTGGGG 
GAGCCGGGCC 
AGCCCGGCGG 
CCTCTCCCGA 
CCGCGTCGGT 
CCGCCTCGGC 
GTTTAATTAA 
GCCCAGTGCT 
TAACTATGAC 
TGGATGAACG 
CGGGCTTGGC 
GGTGAAGAGA 
CGCGTCGGGG 
CGTTTTTTCA 
GCCAAGCGTC 
GGGGAGTTTG 
TCAGGGAGGA 
TCAGTACGAA 
AAGCAGGAGG 
ATAGCGACGT 
TCACCAAGCG 
TGAGACAGGT 
ACGAGAGGAA 
CGAAGCTACC 
AACGATACGG 
CCCGCTCGGC 
TCGGGACCGG 
GGGGGCCGCC 
CATTCGTAGA 
TGCGATCTAT 
CACGCCCGCT 
CGGCCCCCGC 
TCCGTCTCCC 
AGCCTCGTGC 
GGCCTTGGCC 



ACCAAGGAGT 
GAAGGTGAAG 
GAGGGCGCAC 
CGTTAGGACC 
GTGGAGGTCC 
AAGACTAATC 
GCTCTCGCTC 
CCGAAACGAT 
TGGAGC CGGG 
TGCGGGATGA 
AAAGGTGTTG 
AGTGTGTAAC 
C GGG C C CAT A 
GCGCGTCTCT 
GCCGGGTTCG 
CCGCTGCGGT 
ATCTTGGTGG 
TCCATGTGAA 
C CGAAGGG AC 
ATCCCCGAAT 
TCCCGGAGAA 
CGCCCTGGAA 
GGCGGCGTCC 
GCGCCGGGCC 
GAACAATGTA 
TCTAAGGGCT 
GACGAGGCGC 
CCGCCCTCCC 
CCCCCTCCTT 
GCGGCGGGTC 
CGGGGGGCCC 
CTTCCCGTGG 
GTGCCGGCGC 
GGTGCGTGGC 
GTTCCCCCGC 
CGGCGCCTAG 
AACAAAGCAT 
CTGAATGTCA 
TCTCTTAAGG 
AGATTCCCAC 
GGAATCAGCG 
CATGAGAGGT 
TCGGGGCACG 
CTGACCCGGT 
CGTCCCGCGC 
ACTGGGGCGG 
CAGAAACCTC 
TACAGACCGT 
TGTCAGAAAA 
CGCTTTTTGA 
TTGGATTGTT 
TAGTTTTACC 
CCGCAGGTTC 
ATCTGTGGGA 
CAGCGCCGAA 
GGGGTCCCCG 
GGTCCGGTGC 
CTCTCGCCCG 
CGACCTGCTT 
TGAAAGTCAG 
CGCTCGCACG 
GCGGTTGCCC 
GAGGACGGTT 
CGTCGCGACC 
AAGCCGGAGG 



CTAACGCGTG 
GGCCCCGCCC 
CACCGGCCCG 
CGAAAGATGG 
GTAGCGGTCC 
GAACCATCTA 
CCGACGTACG 
CTCAACCTAT 
CGTGGAATGC 
ACCGAACGCC 
GTTGATATAG 
AACTCACCTG 
CCCGGCCGTC 
CGGGGTCGGG 
CCCCCGCGGC 
GAGCCTTGAA 
TAGTAGCAAA 
CAGCAGTTGA 
GGGCGATGGC 
CCGGAGTGGC 
GCCGGCGGGA 
TGGGTTCGCC 
GGTGAGCTCT 
GT AC C CAT AT 
GGTAAGGGAA 
GGGTCGGTCG 
CGCCGCCCTC 
CTCTTCCCCG 
CTTCCCGTCG 
CAACCCCGCG 
GGACACTCGG 
ATCGCCTCAG 
GGGTCCCCTC 
GGGGGCGGGC 
CGGGTCCGCC 
CAGCCGACTT 
CGCGAAGGCC 
AAGTGAAGAA 
TAGC C AAATG 
TGTCCCTACC 
GGGAAAGAAG 
GTAGAATAAG 
CCGGCCTCGC 
GAGGCGGGGG 
GTGCGGGCGG 
TACACCTGTC 
CCGTGGAGCA 
GAAAGCGGGG 
GTTACCACAG 
TCCTTCGATG 
CACCCACTAA 
CTACTGATGA 
AGACATTTGG 
TTATGACTGA 
GGAGCCTCGG 
CGTCGCCCCG 
GGAGAGCCGT 
TCACGTTGAA 
CTGGGTCGGG 
CCCTCGACAC 
CGACCGTGTC 
GAACGACCGT 
CGTTTCTCTT 
GCGGCCTGCC 
GCGGAGGAGG 



CGCGAGTCAG 
GGGGGCCCGA 
TCTCGCCCGC 
TGAACTATGC 
TGACGTGCAA 
GTAGCTGGTT 
CAGTTTTATC 
TCTCAAACTT 
GAGTGCCTAG 
GGGTTAAGGC 
ACAGCAGGAC 
CCGAATCAAC 
GCCGCAGTCG 
GGTGCGTGGC 
GTCGGGCCCC 
GCCTAGGGCG 
TATTCAAACG 
ACATGGGTCA 
CTCCGTTGCC 
GGAGATGGGC 
GGCCTCGGGG 
CCGAGAGAGG 
CGCTGGCCCT 
CCGCAGCAGG 
GTCGGCAAGC 
GGCTGGGGCG 
TCCCACGTCC 
CGGGGCCCCG 
GGGGGCGGGT 
GGGGTTCCGG 
GGGGCCGGCG 
CTGCGGCGGG 
CCCGCGGGGC 
GGGCGTGTCC 
CCCCGGGCCG 
AGAACTGGTG 
CGCGGCGGGT 
ATTCAATGAA 
CCTCGTCATC 
TACTATCCAG 
ACCCTGTTGA 
TGGGAGGCCC 
GGGCCGCCGG 
GGCGAGCCCC 
GCGCGACCCG 
AAACGGTAAC 
GAAGGGCAAA 
CCTCACGATC 
GGATAACTGG 
TCGGCTCTTC 
TAGGGAACGT 
TGTGTTGTTG 
TGTATGTGCT 
ACGCCTCTAA 
TTGGCCCCGG 
CGGCGGCGCG 
TCGTCTTGGG 
CGCACGTTCG 
GTTTCGTACG 
AAGGGTTTGT 
GCCGCCCGGG 
GTGGTGGTTG 
TCCCCTTCCG 
GTCGCCTGCC 
GGGATCGGCG 



GGGCTCGTCC 
GGTGGGATCf ' 
CGCGCCGGG 1 ' i 
TTGGGCAGGG 
ATCGGTCGTC 
CCCTCCGAAG 
CGGTAAAGCG 
TAAATGGGTA 
TGGGCCACTT 
GCCCGATGCC 
GGTGGCCATG 
TAGCCCTGAA 
GAACGGAACG 
GGGGGCCCGT 
GCGGAGCCTA 
CGGGCCCGGG 
AGAACTTTGA 
GTCGGTCCTG 
CTCGGCCGAT 
GCCGCGAGGC 
AGAGTTCTCT 
GGCCCGTGCC 
TGAAAATCCG 
TCTCCAAGGT 
CGGATCCGTA 
CGAAGCGGGG 
GGGG AG AC C C 
TCGTCCCCCG 
CGGGGGT CGG 
AGCGGGAGGA 
GCGGCGGCGA 
CGTCGCGGCC 
CTCGCTCCAC 
CGCGCGTGTG 
CGGTTTTCCG 
CGGACCAGGG 
GTTGACGCGA 
GCGCGGGTAA 
TAATTAGTGA 
CGAAACCACA 
GCTTGACTCT 
CCGGCGCCCG 
TGAAAT AC CA 
GAGGGGCTCT 
CTCCGGGGAC 
GCAGGTGTCC 
AGCTCGCTTG 
CTTCTGACCT 
CTTGTGGCGG 
CTATCATTGT 
GAGCTGGGTT 
CCATGGTAAT 
TGGCTGAGGA 
GTCAGAATCC 
ATAGCCGGGT 
GGGTCTCCCC 
AAACGGGGTG 
TGTGGAACCT 
TAGCAGAGCA 
CTCTGCGGGC 
CGTCACGGGG 
GGGGGGGGAT 
TCGCTCTCCT 
GCCGCAGCCC 
GCGGCGGCGA 
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2cgcggcgcg 
jTTGGAGGGG 
3CCGGGGGGG 
jCGGCGGCGA 
3CGGCGCCTC 
ACTTTGTTTT 

cccccccccc 

CATATAGGTC 
GAGCAGATAT 

ATATACTTAT 
TCGACCGATG 
CCGCGACGCG 
TGGAACCTTA 
TACTTTGTCT 
ATAAATTATC 
TTTGTGTTGT 
TTTGTGTTGT 
GTTGGGTTGG 
TTGTTTGCTG 
TACACAAACA 
TATCCCTTTC 
TGTGTGTGTG 
TACTTATAAT 
GCAGACTTCT 
AATAAATACA 
GTTGACCAGT 
AATAGATAGA 
ATTAACCACT 
ATTTGAACTC 
TGGTCTGTCT 
TGCTTTTTTT 
TCTGTAGACC 
CAATTTTGGA 
ATTTTATTAT 
ATTAGTTGGA 
TTGTGGGGCT 
GATTTTTGTA 
TTTCATTGCT 
CCAGTTCCTC 
GATGTGCTAG 
AAAAGTTCTA 
TGTTCTCACT 
AGACATATAT 
TTCCCAGACG 
CACCACAACT 
GGAAAAGCAT 
GGTTGTGAAC 
AGTCAGGGCT 
TGAATGATCC 
ATAAAATAAT 
CTCACAGCAC 
GAGGGGTGGG 
ATGGCCTGGT 
T AC CTGAAGT 



GTGACGCACG 
CGGGAGGGGT 
CGCTCTCTCC 
CGTGCGTACG 
TTCCATTTTT 
TTTTTTTTCC 
CCCCCCGGCG 
TTAAATTCCT 
G AC C AGT ACT 
CCGAAAGTCC 
TTTGGTGTGC 
AGGAGGAGGT 
ATGGAGGTCG 
GCGGGCTCAC 
AGGTCGACCA 
TTTTCTGAAA 
TGATCTAGAT 
TTTGTTTTGT 
GTTGTGTTGT 
GTTGGGTTGT 
TTGTTTTGTG 
TGCACTTTTT 
CTTCTCTCTC 
TGCGTGTGTG 
AATAGGTCGC 
GAGTTCGAGG 
TACATACATA 
TGTCAATCCT 
TGGATAGAGT 
TTTCCCTTTT 
AGGACCCTGG 
GCTGTTTGTT 
TTTCTTCTGA 
AGCCTGGCCT 
GTAAAGGTGT 
TAGACAGAAC 
CCAATTAGTT 
GGGGATCAGG 
AAGATTACTT 
TCATTTCTAT 
CTGCCTTCTG 
TG AAC C AG AG 
ACAAAGTGAT 
CTGCCACCAA 

GCCTTTTGAG 
CTAACCTGTT 
GTAGCAGTTG 
CACCCACCAT 
CTAAACCGAT 
CAGCATGGGA 
GAAATGAATG 
CTCCCCCTCC 
GTGGGGGCAG 
TCTCTGAACT 
CCCTGAGTGA 



GTGG 1 
TTTT 
GCC O 
AGGG 1 
TCCC 



CCCGATG 1 



CGGA 1 
GGAA' 
CCGG' 



jAT 
2CC 
3 AG 
3 AG 
2CC 



;c<: 



2CC 
j'TG 

:at 

jAT 

:aa 
2tg 

3CG 



catc 

AACG 



CCCCACTCCC 



CTTTA 
TGGTA 
TCTCTTTCCC 
CTCTTTTTGA 
CGACCAGTAC 
ACCAGATGTC 
TCTGGACTCT 
GTTGTCCGTC 
ATCGCAGAGG 
TTGTTTTTCT 
TTTGTTTTGT 
GTTGTGTTGG 
GTTGTTTGGT 
TTTTGCGGGT 
TTAAAATAAA 
TTTTTTAAAA 
TGTGTGTGTG 
CGGGTGGTGG 
CCAGCCTGGT 
CATACATACA 
TTAGAATTTT 
GATACAAATA 
TAGGTTTTTT 
CAGGTCAACT 
TGTTTGCTTG 
GACAGTATTT 
CAATCGAACT 
GCTACACCAC 
GAAATCAACT 
GGCTGGTTTG 
TATCTCAACG 
TTCTTAGTCT 
TTCTCTTTCT 
GAAGATGTAG 
AGTTTGGATG 
CTTTAACTTT 
CGCGCTTTGT 
GGTTTTGCTT 
AATAAAATGG 
TGGCTGTTTT 
TAGGACACAC 
GTGGTTGCCT 
GAGCCATCTC 
AGACAGTCTG 
AAGTCTCCAC 
CCCACACTGC 
GGATCTGCAT 
GTTGAGCCTT 
TGATTTCCCT 



GTCG 
CTTC 
GAGG 
GGGC 
GGTO 
CTTT 1 
TTTA 



'TCGGC 
<0 ;cgt 



:ggtgt 

2GAGGT 
TCGACC 

:actct 

jACCAG 
2TCTTT 
CTCTTC 
CTTATATACA 
TCCGGGCGAC 
CGAAAGTGTC 
TTTTTTTTTT 
TTTCACTCAT 
TCGACC AGAT 
GTTTTTCAGT 
TTTGTTTTGT 
GTTGGGTTGG 
TTTGTGTTGT 
CGAACAGTTG 
TTTTTAAAAT 
ATTTTCTTTG 
CGTGCAGCGT 
TAGCTTCCCG 
CTACAGAGGA 
TACATACATA 
GTTTTTAATT 
TAGGTTTTTT 
TTTTTTTCCC 
GGAAAACGTG 
CTTGCTTGCT 
CTCTGTGTAA 
CAGAAATCCT 
TGCCTGGCAT 
AGTTGGTCCT 
GGAGGTTTCT 
GAATGCATGA 
GAGGAAAAAA 
TTCTTTCTTT 
GCATTGCATT 
TCAAGCCGTA 
I 1 T 1* T T T* T 1 T T T 1 
ACATTGAATG 
GACATGGTTT 
GAGGCCAGAA 
CCTTCCCAAG 
TAGACGAGAG 
GGGATTTGAA 
TCCAGCCCTC 
CCCTCTTTGT 
GTATTTATTT 
CTTTCTCCCT 
GTCTTCTTGC 
GTCTATCCAG 
GTGAATTC 



GCGTCCGTCG 
TCGGCGCCAG 
GCCCCTCCTC 
GGAGGCGGAG 
CGACCAGTAC 
AGATGTCCGA 
GGACTCTTTT 
TTGTCCGTCT 
TTCTGAAAAT 
CCCACAGCGA 
TGTAAATAGT 
ACTTTGTTTT 
CCGTCCCCCC 

TCATATAGGT 
GTCAGAAAGT 
TTTGTGTTGT 
TTTGTTTTGT 
GTTGGGTTGG 
TTGGTGTTGT 
TCCCTAACCG 
AAATGCGAAA 
TGTGTGTGTG 
GCGCGCGCTC 
GACTCCAGAG 
ACCCTGTCTC 
CATACATACA 
AATGTGATAG 
TTTCAGTAAA 
CTGTCCATGT 
TTTTCTATAT 
TGCTTGCTTG 
CCTGGTGCCC 
CCTGCCTCTT 
TATTATCATT 
GTTTCGTTAA 
TTTGTTTCCG 
AGGTTAAGGT 
TAAAATAATA 
CTTTCAGATA 
GGGAAAAGCA 
TAATGTTTAT 
TTTCTCCTTC 
TGAGCTTTGT 
CCCTTTCTAT 
CCAAAGTCTT 
GCACAGATCT 
CACCAGATCT 
CTCAGGATCT 
CTACATTCCT 
GGT AT AT C AC 
CTTCGAGCTA 
ATGTTTGGGT 
AGGTCTGTGA 
AGGCTGACTG 



GGGACGGCCG 
GCCTCTGGCG 
TTCGCGCGCC 
AGGGTCCGGC 
TCCGGGCGAC 
AAGTGTCCCC 
TTTTTTTTTT 
TTTACTCCTT 
CCCAGAGGTC 
TTCTCTTTTT 
GTGTACGTTT 

CCTCCCCCCC 
TTTAAATTTC 
CGACCGGTGG 
CTGGTGGTCG 
TTTGTGTTGT 
TTTGTTTTGT 
GTTGGGTTGG 
TGGTTTTGTT 
AGTTTTTTTG 
ATCGACCAAT 
TGTGTGTGTG 
GTTTTATAAA 
GCAGAGGCAG 
GAAAAATGAA 
TACATATGAG 
AGAGATAGAT 
TATGAGGTTG 
GGTTGCTGGG 
ATATAAATAG 
CTTGCTTGCT 
TGAAACTCAC 
GTCTACCTCC 
ATCATTATTA 
TTCATTTGAA 
ATTTGGGTGT 
GAGATGGCTC 
TTGGGCTACG 
AGGAGGTCGG 
TTGTTTGAGA 
TACAATATAG 
TACTTCTACT 
TTTGCTTAAC 
CCGTGCAGGG 
TTGAATAAAG 
TTCCCAGCAT 
CATTGTGGGT 
TCAGAAGACG 
TCTTAAGGCA 
CATATACTCA 
TCTAAATTCT 
GGGGCTGGGG 
ACTATTTGCG 
GCTAGTTTTC 



LH840 
LHOOO 
1 ^,00 0 
L<»0 20 
L 9 0 8 0 
L 91 4 0 
I 9 2 i ) ( < 

1 92 (> 

L 9 3 S ("' 
LS*44d 
19b 0 (J 

19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 

2 010 0 
2 016 0 
20220 
20280 
2 0 340 
2 040d 
20460 
20520 
2 0 5 8 0 
20640 
2 07 no 
M i -' ,"■ 

::Oto.-.o 

20880 
20940 
2100 0 
21060 
2 112 0 
2 1180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
217 8 0 
2184 0 
21900 
2 1 9 6 < ) 
22020 
22080 
22118 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
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(n) MOLECULE TYPE: Genomic DMA 
( i i. i ) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GCTGACACGC TGTCCTCTGG CGACCTGTCG TCGGAGAGGT TGGGCCTCCG GATGCGCGCG 6 0 

GGGCTCTGGC CTCACGGTGA CCGGCTAGCC GGCCGCGCTC CTGCCTTGAG CCGCCTGCCG 12 0 

CGGCCCGCGG GCCTGCTGTT CTCTCGCGCG TCCGAGCGTC CCGACTCCCG GTGCCGGCCC 180 

GGGTCCGGGT CTCTGACCCA CCCGGGGGCG GCGGGGAAGG CGGCGAGGGC CACCGTGCCC 24 0 

CGTGCGCTCT CCGCTGCGGG CGCCCGGGGC GCCGCACAAC CCCACCCGCT GGCTCCGTGC 3 00 

CGTGCGTGTC AGGCGTTCTC GTCTCCGCGG GGTTGTCCGC CGCCCCTTCC CCGGAGTGGG 3 60 

GGGTGGC CGG AGCCGATCGG CTCGCTGGCC GGCCGGCCTC CGCTCCCGGG GGGCTCTTCG 42 0 

ATCGATGTGG TGACGTCGTG CTCTCCCGGG CCGGGTCCGA GCCGCGACGG GCGAGGGGCG 4 80 

GACGTTCGTG GCGAACGGGA CCGTCCTTCT CGCTCCGCCC GCGCGGTCCC CTCGTCTGCT 54 0 

CCTCTCCCCG CCCGCCGGCC GGCGTGTGGG AAGGCGTGGG GTGCGGACCC CGGCCCGACC 60 0 

TCGCCGTCCC GCCCGCCGCC TTCGCTTCGC GGGTGCGGGC CGGCGGGGTC CTCTGACGCG 660 

GCAGACAGCC CTGCCTGTCG CCTCCAGTGG TTGTCGACTT GCGGGCGGCC CCCCTCCGCG 72 0 

GCGGTGGGGG TGCCGTCCCG CCGGCCCGTC GTGCTGCCCT CTCGGGGGGG GTTTGCGCGA 7 80 

GCGTCGGCTC CGCCTGGGCC CTTGCGGTGC TCCTGGAGCG CTCCGGGTTG TCCCTCAGGT 84 0 

GCCCGAGGCC GAACGGTGGT GTGTCGTTCC CGCCCCCGGC GCCCCCTCCT CCGGTCGCCG 90 0 

CCGCGGTGTC CGCGCGTGGG TCCTGAGGGA GCTCGTCGGT GTGGGGTTCG AGGCGGTTTG 96 0 

AGTGAGACGA GACGAGACGC GCCCCTCCCA CGCGGGGAAG GGCGCCCGCC TGCTCTCGGT 10 2 0 

GAGCGCACGT CCCGTGCTCC CCTCTGGCGG GTGCGCGCGG GCCGTGTGAG CGATCGCGGT 10 80 

GGGTTCGGGC CGGTGTGACG CGTGCGCCGG CCGGCCGCCG AGGGGCTGCC GTTCTGCCTC 114 0 

CGACCGGTCG TGTGTGGGTT GACTTCGGAG GCGCTCTGCC TCGGAAGGAA GGAGGTGGGT 12 0 0 

GGACGGGGGG GCCTGGTGGG GTTGCGCGCA CGCGCGCACC GGCCGGGCCC CCGCCCTGAA 12 6 0 

CGCGAACGCT CGAGGTGGCC GCGCGCAGGT GTTTCCTCGT ACCGCAGGGC CCCCTCCCTT 13 2 0 

CCCCAGGCGT CCCTCGGCGC CTCTGCGGGC CCGAGGAGGA GCGGCTGGCG GGTGGGGGGA 13 a(> 

GTGTGACCCA CCCTCGGTGA GAAAAGCCTT CTCTAGCGAT CTGAGAGGCG TGCCTTGGGG 14 4 0 

GTACCGGATC CCCCGGGCCG CCGCCTCTGT CTCTGCCTCC GTTATGGTAG CGCTGCCGTA L50' 

GCGACCCGCT CGCAGAGGAC CCTCCTCCGC TTCCCCCTCG ACGGGGTTGG GGGGGAGAAG 15 6 0 

CGAGGGTTCC GCCGGCCACC GCGGTGGTGG CCGAGTGCGG CTCGTCGCCT ACTGTGGCCC 16 2 0 

GCGCCTCCCC CTTCCGAGTC GGGGGAGGAT CCCGCCGGGC CGGGCCCGGC GCTCCCACCC 168 0 

AGCGGGTTGG GACGCGGCGG CCGGCGGGCG GTGGGTGTGC GCGCCCGGCG CTCTGTCCGG 174 0 

CGCGTGACCC CCTCCGTCCG CGAGTCGGCT CTCCGCCCGC TCCCGTGCCG AGTCGTGACC 18 0 0 

GGTGCCGACG AC CGCGTTTG CGTGGCACGG GGTCGGGCCC GCCTGGCCCT GGGAAAGCGT 1860 

CCCACGGTGG GGGCGCGCCG GTCTCCCGGA GCGGGACCGG GTCGGAGGAT GGACGAGAAT 192 0 

CACGAGCGAC GGTGGTGGTG GCGTGTCGGG TTCGTGGCTG CGGTCGCTCC GGGGCCCCCG 1980 

GTGGCGGGGC CCCGGGGCTC GCGAGGCGGT TCTCGGTGGG GGCCGAGGGC CGTCCGGCGT 2 04 0 

CCCAGGCGGG GCGCCGCGGG ACCGCCCTCG TGTCTGTGGC GGTGGGATCC CGCGGCCGTG 210 0 

TTTTCCTGGT GGCCCGGCCG TGCCTGAGGT TTCTCCCCGA GCCGCCGCCT CTGCGGGCTC 2160 

CCGGGTGCCC TTGCCCTCGC GGTCCCCGGC CCTCGCCCGT CTGTGCCCTC TTCCCCGCCC 222 0 

GCCGCCCGCC GATCCTCTTC TTCCCCCCGA GCGGCTCACC GGCTTCACGT CCGTTGGTGG 22 8 0 

CCCCGCCTGG GACCGAACCC GGCACCGCCT CGTGGGGCGC CGCCGCCGGC CACTGATCGG 2 34 0 

CCCGGCGTCC GCGTCCCCCG GCGCGCGCCT TGGGGACCGG GTCGGTGGCG CGCCGCGTGG 24 00 

GGCCCGGTGG GCTTCCCGGA GGGTTCCGGG GGTCGGCCTG CGGCGCGTGC GGGGGAGGAG 24 6 0 

ACGGTTC CGG GGGACCGGCC GCGGCTGCGG CGGCGGCGGT GGTGGGGGGA GCCGCGGGGA 252 0 

TCGCCGAGGG CCGGTCGGCC GCCCCGGGTG CCCCGCGGTG CCGCCGGCGG CGGTGAGGCC 2580 

CCGCGCGTGT GTCCCGGCTG CGGTCGGCCG CGCTCGAGGG GTCCCCGTGG CGTCCCCTTC 2 64 0 

CCCGCCGGCC GCCTTTCTCG CGCCTTCCCC GTCGCCCCGG CCTCGCCCGT GGTCTCTCGT 2 70 0 

CTTCTCCCGG CCCGCTCTTC CGAACCGGGT CGGCGCGTCC CCCGGGTGCG CCTCGCTTCC 2 76 0 

CGGGCCTGCC GCGGCCCTTC CCCGAGGCGT CCGTCCCGGG CGTCGGCGTC GGGGAGAGCC 2 82 0 

CGTCCTCCCC GCGTGGCGTC GCCCCGTTCG GCGCGCGCGT GCGCCCGAGC GCGGCCCGGT 2 880 

GGTCCCTCCC GGACAGGCGT TCGTGCGACG TGTGGCGTGG GTCGACCTCC GCCTTGCCGG 2 94 0 

TCGCTCGCCC TCTCCCCGGG TCGGGGGGTG GGGCCCGGGC CGGGGCCTCG GCCCCGGTCG 3 000 

CTGCCTCCCG TCCCGGGCGG GGGCGGGCGC GCCGGCCGGC CTCGGTCGCC CTCCCTTGGC 3 06 0 

CGTCGTGTGG CGTGTGCCAC CCCTGCGCCG GCGCCCGCCG GCGGGGCTCG GAGC CGGGCT 312 0 

TCGGCCGGGC CCCGGGCCCT CGACCGGACC GGCTGCGCGG GCGCTGCGGC CGCACGGCGC 318 0 

GACTGTCCCC GGGCCGGGCA CCGCGGTCCG CCTCTCGCTC GCCGCCCGGA CGTCGGGGCC 3 24 0 

GCCCCGCGGG GCGGGCGGAG CGCCGTCCCC GCCTCGCCGC CGCCCGCGGG CGCCGGCCGC 330 0 

GCGCGCGCGC GCGTGGCCGC CGGTCCCTCC CGGCCGCCGG GCGCGGGTCG GGCCGTCCGC 3 36 0 

CTCCTCGCGG GCGGGCGCGA CGAAGAAGCG TCGCGGGTCT GTGGCGCGGG GCCCCCGGTG 342 0 
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gtcgtgtcgc gtggggggcg ggtggttggg gcgtccggtt cgccgcgccc cgo ''ccggcc 34 ro 

CCACCGGTCC CGGCCGCCGC ccccgcgccc gctcgctccc tcccgtccgc cggtgcgggg 3540 

CCGGTCCGTG CGTCCGTCCG TCGTCCTCCT CGCTTGCGGG GCGCCGGGCC CGTCCTCGCG - ( ■ 0 

AGGCCCCCCG GCCGGCCGTC CGGCCGCGTC GGGGGCTCGC CGCGCTCTAC CTTACCTACC 3t,f.,0 

TGGTTGATCC TGCGAGTAGC ATATGCTTGT CTCAAAGATT AAGCCATGCA TGTCTAAGTA 3'/::0 

CGCACGGCCG GTACAGTGAA ACTGCGAATG GCTCATTAAA TCAGTTATGG TTCCTTTGGT 3VftO 

CGCTCGCTCC TCTCCTACTT GGATAACTGT GGTAATTCTA GAGCTAATAC ATGCCGACGG 3 84 0 

GCGCTGACCC CCTTCGCGGG GGGGATGCGT GCATTTATCA GATCAAAACC AACCCGGTCA 3 90 0 

GCCCCTCTCC GGCCCCGGCC GGGGGGCGGG CGCCGGCGGC TTTGGTGACT CTAGATAACC 3 96 0 

TCGGGCCGAT CGCACGCCCC CCGTGGCGGC G ACG AC C CAT TCGAACGTCT GCCCTATCAA 4 02 0 

CTTTCGATGG TAGTCGCCGT GCCTACCATG GTGACCACGG GTGACGGGGA ATCAGGGTTC 4 080 

GAT T C C GG AG AGGGAGCCTG AGAAACGGCT ACCACATCCA AGGAAGGCAG CAGGCGCGCA 414 0 

AATTACCCAC TCCCGACCCG GGGAGGTAGT GACGAAAAAT AACAATACAG GACTCTTTCG 4200 

AGGCCCTGTA ATTGGAATGA GTCCACTTTA AATCCTTTAA CGAGGATCCA TTGGAGGGCA 4260 

AGTCTGGTGC CAGCAGCCGC GGTAATTCCA GCTCCAATAG CGTATATTAA AGTTGCTGCA 4 32 0 

GTTAAAAAGC TCGTAGTTGG ATCTTGGGAG CGGGCGGGCG GTCCGCCGCG AGGCGAGCCA 4380 

CCGCCCGTCC CCGCCCCTTG CCTCTCGGCG CCCCCTCGAT GCTCTTAGCT GAGTGTCCCG 444 0 

CGGGGCCCGA AGCGTTTACT TTGAAAAAAT TAGAGTGTTC AAAGC AGGC C CGAGCCGCCT 4 50 0 

GGATACCGCA GCTAGGAATA ATGGAATAGG ACCGCGGTTC TATTTTGTTG GTTTTCGGAA 4 56 0 

CTGAGGCCAT GATTAAGAGG GACGGCCGGG GGCATTCGTA TTGCGCCGCT AGAGGTGAAA 4 62 0 

TTCTTGGACC GGCGCAAGAC GGACCAGAGC GAAAGCATTT GCCAAGAATG TTTTCATTAA 46 80 

TCAAGAACGA AAGTCGGAGG TTCGAAGACG ATCAGATACC GTCGTAGTTC C G AC CAT AAA 474 0 

CGATGCCGAC CGGCGATGCG GCGGCGTTAT TCCCATGACC CGCCGGGCAG CTTCCGGGAA 4 8 00 

ACCAAAGTCT TTGGGTTCCG GGGGGAGTAT GGTTGCAAAG CTGAAACTTA AAGGAATTGA 4 86 0 

CGGAAGGGCA CCACCAGGAG TGGAGCCTGC GGCTTAATTT GACTCAACAC GGGAAACCTC 4 92 0 

ACCCGGCCCG GACACGGACA GGATTGACAG ATTGATAGCT CTTTCTCGAT TCCGTGGGTG 4 98 0 

GTGGTGCATG GCCGTTCTTA GTTGGTGGAG CGATTTGTCT GGTTAATTCC GATAACGAAC 5 04 0 

GAGACTCTGG CATGCTAACT AGTTACGCGA CCCCCGAGCG GTCGGCGTCC CCCAACTTCT 5100 

TAGAGGGACA AGTGGCGTTC AGCCACCCGA GATTGAGCAA TAACAGGTCT GTGATGCCCT 516 0 

TAGATGTCCG GGGCTGCACG CGCGCTACAC TGACTGGCTC AGCGTGTGCC TACCCTACGC 52 2 0 

CGGCAGGCGC GGGTAACCCG TTGAACCCCA TTCGTGATGG GGAT CGGGGA TTGCAATTAT 52 b'. 

TCCCCATGAA CGAGGGAATT CCCGAGTAAG TGCGGGTCAT AAGCTTGCGT TGATTAAGTC 534 0 

CCTGCCCTTT GT AC AC AC CG CCCGTCGCTA CTACCGATTG GATGGTTTAG TGAGGCCCTC S4 0 1 

GGATCGGCCC CGCCGGGGTC GGCCCACGGC CCTGGCGGAG CGCTGAGAAG ACGGTCGAAC 54 6 0 

TTGACTATCT AGAGGAAGTA AAAGTCGTAA CAAGGTTTCC GTAGGTGAAC CTGCGGAAGG 55 2 0 

ATCATTAACG GAGCCCGGAG GGCGAGGCCC GCGGCGGCGC CGCCGCCGCC GCGCGCTTCC 55 80 

CTCCGCACAC CCACCCCCCC ACCGCGACGC GGCGCGTGCG CGGGCGGGGC CCGCGTGCCC 5 64 0 

GTTCGTTCGC TCGCTCGTTC GTTCGCCGCC CGGCCCCGCC GCCGCGAGAG CCGAGAACTC 5 7 00 

GGGAGGGAGA C GGGGGGG AG AGAGAGAGAG AGAGAGAGAG AGAGAGAGAG AGAGAGAGAA 576 0 

AGAAGGGCGT GTCGTTGGTG TGCGCGTGTC GTGGGGCCGG CGGGCGGCGG GGAGCGGTCC 5 82 0 

CCGGCCGCGG CCCCGACGAC GTGGGTGTCG GCGGGCGCGG GGGCGGTTCT CGGCGGCGTC 5880 

GCGGCGGGTC TGGGGGGGTC TCGGTGCCCT CCTCCCCGCC GGGGCCCGTC GTCCGGCCCC 5 94 0 

GCCGCGCCGG CTCCCCGTCT TCGGGGCCGG CCGGATTCCC GTCGCCTCCG CCGCGCCGCT 600 0 

CCGCGCCGCC GGGCACGGCC CCGCTCGCTC TCCCCGGCCT TCCCGCTAGG GCGTCTCGAG 6 060 

GGTCGGGGGC CGGACGCCGG TCCCCTCCCC CGCCTCCTCG TCCGCCCCCC CGCCGTCCAG 612 0 

GTACCTAGCG CGTTCCGGCG CGGAGGTTTA AAGACCCCTT GGGGGGATCG CCCGTCCGCC 6180 

CGTGGGTCGG GGGCGGTGGT GGGCCCGCGG GGGAGTCCCG TCGGGAGGGG CCCGGCCCCT 6 24 0 

CCCGCGCCTC CACCGCGGAC TCCGCTCCCC GGCCGGGGCC GCGCCGCCGC CGCCGCCGCG 6300 

GCGGCCGTCG GGTGGGGGCT TTACCCGGCG GCCGTCGCGC GCCTGCCGCG CGTGTGGCGT 6 360 

GCGCCCCGCG CCGTGGGGGC GGGAACCCCC GGGCGCCTGT GGGGTGGTGT CCGCGCTCGC 64 2 0 

CCCCGCGTGG GCGGCGCGCG CCTCCCCGTG GTGTGAAACC TTCCGACCCC TCTCCGGAGT 64 80 

CCGGTCCCGT TTGCTGTCTC GTCTGGCCGG CCTGAGGCAA CCCCCTCTCC TCTTGGGCGG 6 54 0 

GGGGGGCGGG GGGACGTGCC GCGCCAGGAA GGGCCTCCTC CCGGTGCGTC GTCGGGAGCG 6 60 0 

CCCTCGCCAA ATCGACCTCG TACGACTCTT AGCGGTGGAT CACTCGGCTC GTGCGTCGAT 6 66 0 

GAAGAACGCA GCTAGCTGCG AGAATTAATG TGAATTGCAG GACACATTGA TCATCGACAC 6 72 0 

TTCGAACGCA CTTGCGGCCC CGGGTTCCTC CCGGGGCTAC GCCTGTCTGA GCGTCGCTTG 678 0 

CCGATCAATC GCCCCGGGGG TGCCTCCGGG CTCCTCGGGG TGCGCGGCTG GGGGTTCCCT 6 84 0 

CGCAGGGCCC GCCGGGGGCC CTCCGTCCCC CTAAGCGCAG ACCCGGCGGC GTCCGCCCTC 6 90 0 

CTCTTGCCGC CGCGCCCGCC CCTTCCCCCT CCCCCCGCGG GCCCTGCGTG GTCACGCGTC 6 96 0 

GGGTGGCGGG GGGGAGAGGG GGGCGCGCCC GGCTGAGAGA GACGGGGAGG GCGGCGCCGC 702 0 

CGCCGGAAGA CGGAGAGGGA AAGAGAGAGC CGGCTCGGGC CGAGTTCCCG TGGCCGCCGC 70 80 

CTGCGGTCCG GGTTCCTCCC TCGGGGGGCT CCCTCGCGCC GCGCGCGGCT CGGGGTTCGG 714 0 

GGTTCGTCGG CCCCGGCCGG GTGGAAGGTC CCGTGCCCGT CGTCGTCGTC GTCGCGCGTC 72 0 0 

GTCGGCGGTG GGGGCGTGTT GCGTGCGGTG TGGTGGTGGG GGAGGAGGAA GGCGGGTC CG 72 6 0 

GAAGGGGAAG GGTGCCGGCG GGGAGAGAGG GTCGGGGGAG CGCGTCCCGG TCGCCGCGGT 7320 
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TCCG-; 



jTG GCGGCCCGGC GTCCGGCCGA CCGGCCGCTC CCCGCGCCC-: 



lLCT«„CI<_L-„ L-j-jjy.LVUr CCTCCGAGGC ^C^G^fVGTr rTiTTrr^ri' 1 T p ( v PP p ™ I , ? 

tacgggcggg r',^ C( -^rvY^ nTl -, (TTrrr r~;";;;";; u -- TU ~'- c, - ( ^'- 'iio 

r _ ;^;^-^;^ J ^ ( -cggltlgl ltu^ggcgc gtcggccggg gcggggagcv< - jn 

CGClLC<^u, L.CCGCCCGTG GCCGCGGCGC CGGGGTTGGG GTGTCm-rr rrrrrA^PP 
CGGGACGCCG CGGTGTCGTC CGCCGTCGCG CGCcSfTr GGGctS^GG CCG^CrGG 7 

sss s~ ~i i 5 

=S SSS = i I 

GGAGCGTCGG GGGGGCGGTC GGGCCGGCGC GATTCCGTCC GTCCCTCCGC CCACrrrrrr I n 
GTCCrrrTrr _ ' 1 - ~ biL ^^LGC CGAGCGGCCC 7y20 

7 • j y o 

8040 
8100 



8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
91?') 
U> ) 
9 2 4') 



GTCCCCCTCC GAGACGCGAC C^GATCAG AC^GGCGAC CCGCTgS?5 
AGTCAGCGGA GGAAAAGAAA CTAACCAGGA TTCCCTCAGT AACGGCCAPT JSSH 
°^ CAGCG CCG ^TCCCC GCCCCGCGGG GCGCGGGaS tcSgCctS gSa^GAC^ 
CTCCCCGGCG CCGCTCGTGG GGGGCCCAAG TCCTTCTGAT CGAGGCCCAG ic^TCCArr o^n 

gtgtgaggcc ggtagcggcc ggcgcgcgcc cgggtcttcc cgSag?cggg C 

^™^ CCCA ^CGGGTGG TAAACTCCAT CTAAGGCTAA SSSgSS 

gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagag?? SgS 

TGAAACCGTT AAGAGGTAAA CGGGTGGGGT CCGCGCAGTC CGCCCGGAGG A^cScr 

gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgc?ccc?g 

CCCCTCCACC CGCCCTCCCT TCCCCCGCCG CCCCTCCTCC TCCTCCcSg AGGGGGCVrr^ 
^^ CGGG TGCGGGGGTG GGCGGGCGGG GCCGGGGGTG GGGTCGGCGG GGcScOTCr 
CCCGACCGGC GACCGGCCGC CGCCGGGCGC ATTTCCACCG CGGCGGTGCG rrrrrS 
CTCCGGGACG GCTGGGAAGG CCCGGCGGGG AAGGTGGCTC SgGGgSc? GTCCGTCCPT 
CCGTCCTCCT CCTCCCCCGT CTCCGCCCCC CGGCCCCGCG TCCTCcScS Srrrrr 
CGGGTCGGGG CGGCGGCGGC GGCGGCGGTG GCGGCGGCGG CGGGGGCGGC GgSScoaaa 

ccccccccga gtgttacagc ccccccggca gcagcactcg ccgaS?cc?g gggJSS^ 
agcgagaccc gtcgccgcgc tctcccccct cccggcgccc acccccgcS 
gcgagggggg tctcccccgc gggggcgcgc cggcgtctcc tStSggggg Sr™ 

CCCTCCCACG GCGCGACCGC TCTCCCACCC CTCCTCCCCG CGcSSgS CCGGCGArrr 

gSgcgcS? Sgg?cS rrt^ CGG ggcggagtg t SccaSgcg c^gSSg 

i^ic^CGCCGT CGGGCCCGGG GGAGGTTCTC TCGGGGCCAC GCGCGCGTrr rrrnnnnKt-n 
T^n^^ AGCGAGCGG A CGGGGT CGGC GGCGACGTCG GC?SSc? cSS?S?S 
™^ACGG ACCAAGGAGT CTAACACGTG CGCGAGTCGG GGGCTCGCAC SSSSS 
GTGGCGCAAT GAAGGTGAAG GCCGGCGCGC TCGCCGGCCG AGGTGGGATC CCGAGrrrTP 

IcgScag g?S?S? S a ^ ggccc gtgtcggggg SSESSSS SSSSSS 

t^™, GTGTTAGGAC CCGAAAGATG GTGAACTATG CCTGGGCAGG rrrasrnran 

AGGAAACTCT GGTGGAGGTC CGTAGCGGTC CTGACGTGCA AATCGGTCCT TCC^CTrrr 
^I A ^ GGCG AAAGAC TAAT CGAAC CATC T AGTAGCTGGT TCCCTCCGA^ SSicSJS 
^ A ^ CTGG CGCTCTG GCA GACCCGACGC ACCCCCGCCA CGCAGTTTTA tcSctSS 
t 0 ™™ G AGGTC TTGG GGCCGAAACG ATCTCAACCT ATTCTCAAAC SSKSSSS 
E£S£ GCCC GGCTCGCTGG CGTGGAGCCG GGCGTGGAAT GCGAGTGCCT AGTgSSS 
TTTTGGTAAG CAGAACTGGC GCTGCGGGAT GAACCGAACG CCGGCTtSg ScSc^GATC 
CCGACGCTCA TCAGACCCCA GAAAAGGTGT TGGTTGATAT AGACAGC^ ACGGTGCCPA 
l^nl^ G AATCCGCTAA GGAGTGTGTA ACAACTCACC TGCcSS ic^lcCCTC 
AAAATGGATG GCGCTGGAGC GTCGGGCCCA TACCCGGCCG TCGCCGgSg tcS^AP^ 

AGGG i AGCGG ggggg gcggc gcgcgcgcgc gcgcgtgtgg tgtgcgtSS IggSSgcS? 
^,2^, CGGC ggcgggggtg tggggtcctt cccccgcccc cccccccKS Jc??c??cc? 
tcctcccgcc cacgccccgc tccccgcccc cggagccccg cggacgctac Sc??SS^a 

rln^n^ 00 CGCTGCGG TG AGCCTTGAAG CCTAGGGCG? gS?gS5 GGAgSSSS 

caggtgcaga tcttggtggt agtagcaaat attcaaacga gaactt?gaI gg??gSg?g 
SS cgaagggacp A ^ TGAA gatggg ^ gag tSSSSS gaga?Sg 

^'f CGAAGGGACG GGCGATGGCC TCCGTTGCCC TCGGCCGATC riaarr^rT 

CGGGTTCAGA TCCCCGAATC CGGAGTGGCG GAGATGGGCG CCGCGAGGCG r^rrrrrr 
™^ ACC GATCCCGGAG AAGCCGGCGG GAGCCCCGGG GAGAGTTCTC 
GAAGGGCAGG GCGCCCTGGA ATGGGTTCGC CCCGAGAGAG GGGCCCGTGC SJSES? 

gJgSS cg?g^ggc cS^^ C tcgctggggg t?ga^t?c 
^l^^t^ 11 - 1 cg cgccgggc cgtacccata tccgcagcag gtctccaagg tgaaparrrt 

CTGGCATGTT GGAACAATGT AGGTAAGGGA AGTCGGCAAG CCGGATCCGT ^CTTCGGgI 
TAAGGATTGG CTCTAAGGGC TGGGTCGGTC GGGCTGGGGC GCGAAGCGgS 

S££?2S SS GGCGGC gg acgcccgg gg^cS? ?SSgS? 

Sjjc c ?c c c c c c § = c c ^G C Sc c sss sssss 

ggggagaagg gtcggggcgg caggggccgc gcggcggccg ccggggcgg? ?GG?Scr?r 

AGGTCCCCGC GAGGGGGGCC CCGGGGACCC GGGGGGCCGG CGGCGGCGCG 
GCGAGCCGGG CCCTTCCCGT GGATCGCCCC AGCTGCGG^G GGCCTCgSg ^JSc^GG 



9360 
94 2 0 
94 8 V 
9540 
9600 
966 ; 
9720 
9780 
9840 
9900 
9960 
10020 
1008U 
10140 
10200 
1026;- 
10320 
10 380 
10440 
10 50 0 
10560 
1062O 
1068O 
1074O 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 



-198- 



GGAGCCCGGC 
GTCCGCTGGG 
TCGTCCCCCC 
GCGCGGCGGC 
CGCCCCCGGG 
AGAACTGGTG 
CGCGGCGGGT 
ATTCAATGAA 
CCTCGTCATC 
TACTATCCAG 
ACCCTGTTGA 
TGGGAGGCCC 
CCTGCGGGCC 
GGGGGGGCGA 
ACCCGCTCCG 
GTAACGCAGG 
GCAAAAGCTC 
CGATCCTTCT 
ACTGGCTTGT 
TCTTCCTATC 
AACGTGAGCT 
TGTTGCCATG 
GTGCTTGGCT 
TCTAAGTCAG 
TCGGATAGCC 
GCCGCGGGAG 
TCGTCCTGGG 
CACGTTCGTG 
TTCGTACGTA 
GGGTTTGTCC 
CCGTCCTTCC 
GGGTGGGGGG 
TCCCCGTTCA 
GTTTGGGAGC 
CGGGGGTTGG 
TCCTCCTCGC 
GAGCCCCACG 
CCTCCGGGGT 
CGGGTCGACC 
GCAACCGGAG 
GCTCCGGACT 
GCAGCGGCGC 
CTCGGCCCGC 
CACCGCGAGT 
CTGCCTCCTC 
TTTTTTCTTT 
CTTCTTGTGT 
CTCTCGCTCT 
TCGCCCTCTC 
GTCGCTCTCG 
CCCTCCCTCC 
GCCGCTGTCT 
CAGTGCCCGT 
TCGACCAGCT 
ACCAGCAGGC 
GCGCCCGCGC 
TCCTACATTT 
ACTGCTGCCG 

TTCTTTCTTT 
TTTCTCTCTC 
TTTCTCTCTC 
TCCCCCTCCC 
TGTCTCGCCG 
CCCGTCGGGA 



GGCGGCGCGG 
GGCGGGAGCG 
GCCCTACCCC 
GGCGGCGGCA 
GCCGCGGTTC 
CGGACCAGGG 
GTTGACGCGA 
GCGCGGGTAA 
TAATTAGTGA 
CGAAACCACA 
GCTTGACTCT 
CCGGCGCCCC 
GCCGGTGAAA 
GCCCGAGGGG 
GGGACAGTGC 
TGTCCTAAGG 
GCTTGATCTT 
GACCTTTTGG 
GGCGGCCAAG 
ATTGTGAAGC 
GGGTTTAGAC 
GTAATCCTGC 
GAGGAGCCAA 
AATCCCGCCC 
GGTCCCCCGC 
GGCGCGTGCC 
AAACGGGGCG 
GGGAACCTGG 
GCAGAGCAGC 
GCGCGCGCGT 
GTTCGTCTTC 
GAGGGCGCGC 
CGCCGGGGCG 
CGCGGAGGCG 
CCGCGCGGCG 
TCCTCCGCAC 
GGCGTCCCCG 
CGACCGCCTG 
GCGGCCTTCT 
CGTCCCCGTC 
TAGCCGGCGT 
ACGCACGCGA 
GGTGGAGCTG 
TTGCGTCCGC 
CTTTTTCGCT 
CTTTCTTTCT 
TCTCTTCTTG 
CGCCCTCTCT 
TCTCTCTCTT 
CCCTCTCGCT 
CTCCCTCCCT 
CGCCGTACCC 
CGGGACGAGC 
GCCGCCCGCG 
GGCCGCCGGA 
CTCCACCGGC 
TTTTCAGCCC 
TCAGCCAGTA 

CTTTCGCTCT 
TCTCTCTCTC 
TCTCTGTCTC 
TCCCTCTCTC 
TGTCCCGGGT 
CGAGCCGGAC 



C G C G C C C < 'J C C 
GTCGGGCGGC 
CCCGGCGCCG 
GGCGGCGGAG 
CGCGCGCGCC 
GAATCCGACT 
TGTGATTTCT 
ACGGCGGGAG 
CGCGCATGAA 
GCCAAGGGAA 
AGTCTGGCAC 
CCCGGTGTCC 
TACCACTACT 
CTCTCGCTTC 
CAGGTGGGGA 
CGAGCTCAGG 
GATTTTCAGT 
GTTTTAAGCA 
CGTTCATAGC 
AGAATTCGCC 
CGTCGTGAGA 
TCAGTACGAG 
TGGGGCGAAG 
AGGCGAACGA 
CTGTCCCCGC 
CCGCCGCGCG 
CGGCCGGAAA 
CGCTAAACCA 
TCCCTCGCTG 
GCGTGCGGGG 
CTCCCTCCCG 
GACCCCGGTC 
GCTCGTCCGC 
CCGCGCCGAG 
CGGTGGGGGG 
GGGTCGACCG 
CACCCGGCCG 
CGCCCGCGGG 
CCACCGAGCG 
TCGGTCGGCA 
CTGCACGTGT 
GGGCGTCGAT 
GGACCACGCG 
GGGACCTTTA 
TTTAGGTTTT 
TTCTTTCTTT 
CTCTTCCTCT 
CTCTTCTCTC 
CTCTCTGTCT 
CTCTCTCTGT 
CCCTCCCCTT 
CGGGTCGACC 
CGGACCCGCC 
AGCTCCGGAC 
CGCAGCGGCG 
CTCGGCCCGC 
CACCGCGAGT 
CTGCCTCCTC 
TTTTTTCTTT 
CGCTCTCTCG 
TCTCTCTCTC 
TCTCTCTCTC 
CCCTTCCTTG 
CGACCGGCGG 
CCGCCGCGTC 



ACCCCCACCC 
GGCGGTCGGC 
TCCGCCCCCC 
GGGCCGCGGG 
TCGCCTCGGC 
GTTTAATTAA 
GCCCAGTGCT 
TAACTATGAC 
TGGATGAACG 
CGGGCTTGGC 
GGTGAAGAGA 
CCGCGAGGGG 
CTGATCGTTT 
TGGCGCCAAG 
GTTTGACTGG 
GAGGACAGAA 
ACGAATACAG 
GGAGGTGTCA 
GACGTCGCTT 
AAGCGTTGGA 
CAGGTTAGTT 
AGGAACCGCA 
CTACCATCTG 
TACGGCAGCG 
CGGCGGGCCG 
CCGGGACCGG 
GGCGGCCGCC 
TTCGTAGACG 
CGATCTATTG 
GGCCCGGCGG 
GCCTCTCCCG 
GGCCGCCCCG 
TCCGGGCCGG 
CCGGGCCCCG 
CCACCCGGGG 
ACGAACCGCG 
ACCTCCGCTC 
CGTGAGACTC 
GCGGTGTAGG 
CCTCCGGGGT 
CCCGGGTCGA 
TCCCCTTCGC 
GAACTCCCTC 
AGAGGGAGTC 
GCTTGCCTTT 
CTTTCTTTCT 
GTCTGTCTCT 
TCTCTCTCTC 
CTCTCTCTCT 
CTCTGTCTGT 
CCTTGGCGCC 
GGCGGGCCTT 
GCGTCCCCGT 
TTAGCCGGCG 
C AC CGACGGA 
CGTGGAGCTG 
TTGCGTCCGC 
CTTTTTCGCT 
CTTTCTTTCT 
CTCTCTCCCT 
TCTGTCTCTC 
TCTCTCTCTC 
GCGCCTTCTC 
GCCTTCTCCA 
CCCGTCTCGG 



CACGTCTCGG 
GGGCGGCGGG 
GTTCCCCCCT 
CCGGTCCCCC 
CGGCGCCTAG 
AACAAAGCAT 
CTGAATGTCA 
TCTCTTAAGG 
AGATTCCCAC 
GGAATCAGCG 
CATGAGAGGT 
CCCGGGGCGG 
TTTCACTGAC 
CGCCCGCCCG 
GGCGGTACAC 
ACCTCCCGTG 
ACCGTGAAAG 
GAAAAGTTAC 
TTTGATCCTT 
TTGTTCACCC 
TTACCCTACT 
GGTTCAGACA 
TGGGATTATG 
CCGCGGAGCC 
CCCCCCCCTC 
GGTCCGGTGC 
CCCTCGCCCG 
ACCTGCTTCT 
AAAGTCAGCC 
GCGTGCGCGT 
CCGACCGCGG 
CTTCTTCGGT 
GACGGGGTCC 
TGGCCCGCCG 
TCCCGGCCCT 
GGTGGCGGGC 
GCGACCTCTC 
AGCGGCGTCT 
AGTGCCCGTC 
CGACCAGCTG 
CCAGCAGGCG 
GCGCCCGCGC 
TCCCACATTT 
ACTGCTGCCG 

TTCTTTCTTT 
CTCTCTCTCT 
TCTCTCTCTG 
CTCTCTCTCT 
GTCTCTCTCT 
TTCTCGGCTC 
CTCCACCGAG 
CTCGGTCGGC 
TCTGCACGTG 
GGGCGCTGAT 
GGACCACGCG 
GGGACCTTTA 
TTTAGGTTTT 
TTCTTTCTTT 
CGCTCGTTTC 
GCTCTCGCCC 
TCTCTCTCTC 
GGCTCTTGAG 
CCGAGCGGCG 
TCGGCACCTC 



TCGCGCGCG:' ' 

gcgggg-'xjgt 

CCTCCTCGG ' 
CCGCCGGGTC 
CAGCCGACTT 
CGCGAAGGCC 
AAGTGAAGAA 
TAGCCAAATG 
TGTCCCTACC 
GGGAAAGAAG 
GTAGAATAAG 
GGTCCGCGGC 
CCGGTGAGGC 
GCCGGGCGCG 
CTGTCAAACG 
GAGCAGAAGG 
CGGGGCCTCA 
CACAGGGATA 
CGATGTCGGC 
ACTAATAGGG 
GATGATGTGT 
TTTGGTGTAT 
ACTGAACGCC 
TCGGTTGGCC 
CACGCGCCCC 
GGAGTGCCCT 
TCACGCACCG 
GGGTCGGGGT 
CTCGACACAA 
TCGGCGCCGT 
CGTGGTGGTG 
TCCCGCCTCC 
GGGGAGCGTG 
GTCCCCGTCC 
CGCGCGTCCT 
GGCGGGCGGC 
CTCGGTCGGG 
CGCCGTGTCC 
GGGACGAACC 
CCGCCCGCGA 
GCCGCCGGAC 
CTCCACCGGC 
TTTTCAGCCC 
TCAGCCAGTA 

CGCTTGTCTT 
CTCTCTCTGT 
TCTCTCGCTC 
CTCTCTCTCT 
CTCCCTCCCT 
TTGAGACTTA 
CGGCGTGCCA 
ACCTCCGGGG 
TCCCGGGTCG 
TCCCGTTCAC 
GAACTCCCTC 
AGAGGGAGTC 
GCTTGCCTTT 
CTTTCTTTCT 
TTTCTTTCTC 
TCTCTCTCTC 
CCTCCCTCCC 
ACTTAGCCGC 
TGCCACAGTG 
CGGGGTCGAC 
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CAGCTGCCGC CCGCGAGCTC CGGACTTAGC CGGCGTCTGC ACGTGTCCCG GGTCGACCAG 

LAGGCGGCCG CCGGACGCTG CGGCGCACCG ACGCGAGGGC GTCGATTCCG GTTCACGrrv- 

rACATTTTTT TCKrrrrrlr ^'; CCGCGGT '^AGCTGGGA CCACGCGGAA CTCCCTCTCC V.iOO 

^S^ TTTTT TCAG CCCCAC CGCGAGTTTG OGTCCGCGGG ACTTTTAAGA GGGAGTCACT 1<t.;o 

S^SSS^ ^™ TC CTTCCTCCTT TTTTGCTTTT TGGTTTTGCC TTG^gSS 1.4.20 



ZZZ^J^ZZ Z^lZl™ ZII'Z T I CTT TCTTTCTTTC TCTCTCTCTC TCTCTCTCTC 1-430 
JES™ ----- TGGTCTCGAA C^CGACCT SS SgJ 



CTCTGTCTC TCTCTCTCTC TCTCTCTCCC CTCCCTCCCT CCTTGGTGCC TTCTCGGCTC 15^0 

CTCCAG GGTT CAAGCAAACA GCAAGTTTTC TATTTCGAGT 1' 



CCGCCTCGGC CTCCCAAAGA CTGCTGGGAG TACAGATGTG AGCCACCATG CCCGGCCGAT 1*7- n 
TCCTTCCTTT TTTCAATCTT ATTTTCTGAA CGCTGCCGTG TATGAACATA CATCTaSKI ^780 

CACACACACA CACACACACA CACACACCCC GTaSSSS SIo 



AACTATGTAA ATGATATTTC CATAATTAAT ACGTTTATAT TATGTTACTT TTAATGGATG 15QO0 

AATATGTATC GAAGCCCCAT TTCATTTACA TACACGTGTA TGTATATCCT TCCTCCCTTC lUrl 

CTTCATTCAT TATTTATTAA TAATTTTCGT TTATTTATTT TCTTTTCTTT TGGGGCCGGC 160 n 

CCGCCTGGTC TTCTGTCTCT GCGCTCTGGT GACCTCAGCC TCCCAaSS SSgGACTAC leSsS 

AGGGATCTCT TAAGCCCGGG AGGAGAGGTT AACGTGGGCT GTGATCGCAC ACTTCCaSc llllo 

CAGCTTACGT GGGCTGCGGT GCGGTGGGGT ggggtggggt ggggtggggt gcagagaS Itloo 

cgattgattg cgatctcaat tgccttttag cttcattcat accctgttat ttgctcct?? ItlTo 

attctcatgg gttcttctgt gtcattgtca cgttcatcgt ttgcttgcct GCTTGCCTCT 16^0 

ttatttcctt CCTTCCTTCC ttccttcctt ccttccttcc ttccttcctt ccctccct?a lllto 

ctggcagggt cttcctctgt ctctgccgcc caggatcacc ccaacctcaa cStSSK? llllo 

cgaccaaacg gtcgttctgc ctctgatccc tcccatcccc attacctgag actacaggcg ItHl 

cgcaccacca caccggctga cttttatgtt gtttctcatg ttttccgtag gtaggt*a?c? lllll 

gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt StctcJatc? 111*0 

atgtatgtac gtatgtatgt atgtatgtga gtgagatggg tttcggggtt ctatoSSS llllo 

GCCCACGCTG GTCTCGAACT CCTGTCCTCA AGCAATCCGC CTGCCTGCCT CGGcSScI 16 

CACTGCTGCT ATTACAGGCG TGAGACGCTG CGCCTGGCTC CTTCTACATT TG?CTGCCTC 168$ S 

CCTGCCTGCC TGCCTGCCTA TCAATCGTCT TCTTTTTAGT ACGGATGTCG TCTCGctSa 1686 0 

TTGTCCATGC TCTGGGCACA CGTGGTCTCT TTTCAAACTT CTATGATTAT tSSSS lll-O 

GGCGTCATCT CACGTGTCGA GGTGATCTCG AACTTTTAGG CTCCAGAGAT CCTCC*CgSEt" llllo 
CGGCCTCCCG GAGTGCTGTG ATGACACGCG TGGGCACGGT ACGCTCTGGT CGTGTTTGTC 

GTGGGTCGGT TCTTTCCGTT TTTAATACGG GGACTGCGAA CGAAGAAAAT TTTcIgISc 7?i ■ 

ATCTCACCGA TCCGCCTTTT CGTTCTTTCT TTTTATTCTC TTTAGACGGA GTTTCACTCT 17 0 

TGTCGCCCAG GGTGGAGTAC GATGGCGGCT CTCGGCTCAC CGCACCCTCC SSSSS 17^0 

^^ GTGAT TCTCCTGCCT CAGCCTTCCC GAGTAGCTGG AATGACAGAG ATGAGCCATC \r>ll 

GTGCCCGGCT AATTTTTCTA TTTTTAGTAC AGATGGGGTT TCTCCATCTT GGTcStc I73I 

GTCTTCAACT TCCGACCGTT GGAGAATCTT AACTTTCTTG GTGGTGGTTG TtScctS? lllol 

TCTTTTTTTT TCTTTTCTTT TCTTTCCTTC TCCTCCCCCC CCCACCCCCC TTGTCGTCGT ^7460 

CCTCCTCCTC CTCCTCCTCC TCCTCCTCCT CCTCCTCCTC CTCCTCCTCC TCmSSS ^7520 

CTTTCAGCTG GGCTCTCCTA CTTGTGTTGC TCTGTTGCTC ACGCTGGTCT CAAACTCCTG llllo 

GCCTTGACTC TTCTCCCGTC ACATCCGCCG TCTGGTTGTT GAAATGAGCA TC^TCOTaI Villi 

AATGGAAAAG ATGAAAGAAA TAAACACGAA GACGGAAAGC ACGGTGTGAA SSSSSJ llltl 

GCCGTCTCCC GGGGTGTACC TTGGACCCGG AAACACGGAG GGAGCTTGGC TGAGTGGGTT 17760 

TTCGGTGCCG AAACCTCCCG AGGGCCTCCT TCCCTCTCCC CCTTGTCCCC GCTT^G^ ^7820 

CAGCCGAGGC TCCCACCGCC GCCCCTGGCA TTTTCCATAG GAGAGGTATG GGAGA^ACT Villi 

GACACGCCTT CCAGATCTAT ATCCTGCCGG ACGTCTCTGG CTCGGCGTGC CCoSoSS Villi 

ACCTGCCACC TTCCAGGGAG CTCTGAGGCG GATGCGACCC CCACCCCCCC GTCACCTCCC Ifinnn 

GCTACCCTCC CCCGGCTGGC CTTTGCCGGG CGACCCCAGG GGAACCGCGT ^gSgCTgS llllo 

TCGGATCCTC CGGCGAAGAC TTCCACCGGA TGCCCCGGGT GGGCCGGTTG GGATCAGACT Hill 

GG ^CACCCC GGACCGTGCT GTTCTTGGGG GTGGGTTGAC GTACAGGGTG GActScaS Hill 

5£™ GCATTG TAAAGGGTGC GTGGGTATGG AAATGTCACC TAGGATGCCC TCCTTCCCTT 18240 

G ^ GTGCCT TCAGCTGCCT CAGGCGTGAA GACAACTTCC CATCGGAACC TCTTCTcSS lllil 

CCTTTCTCCA GCACACAGAT GAGACGCACG AGAGGGAGAA ACAGCTCAAT AGATACCGCT llllo 

GACCTTCATT TGTGGAATC C TCAGTCATCG ACACACAAGA CAGGTGACTA GGCaSgGAcI 1842^ 

CAGATCAAAC ACTATTTCCG GGTCCTCGTG GTGGGATTGG TCTCTCTCTC TCTCTCTCTC lltlo 

TCTCTCTCTC TCTCTCTCTC TCTCGCACGC GCACGCGCGC ACACACACAC ACAaStcCA ^8S40 

TATCTAGTTC ACAGAGCACA CTCACTTCCC CTTTTCACAG TACGCAGGCT GACTaIa^TO IIsoS 

cgccccaccc tccacccgtt ggctgacgaa accccttctc tacaattgat gaaaKa^?g llllo 
atctgggccg ggcacgctag ctcacgcctg tcactccggc actttgggag GCCGAGGCGG 111 0 
gtggatcgct tggggccggg agttcgagac caggctggcc gacgtggcSa ScSStc? llifo 

CTCTGAAAAA TAGAACGATT AGCCGGGCCT GGTGGCGTGG GCTTGGAATC ACGACCGCTC llllo 

gggagactgg ggcgggcgac ttgttccaac cggggaggcc gaggccgcga ?gaSc?gaga 

TCGTGCCGTG GCGATGCGGC CTGGATGACG G AGCGAGAC C CCGTCTCGAG AGAAtJaTGA llllo 
TGTTATTATA AGATGAGTTG TGCGCGGTGA TGGCCGCCTG TAGTCGCGGC TACTCGGGaS 190^ 
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GCTGAGACGA 
CCACTGTATC 
TTTCTTCCGT 
TTCTTTCTTT 
TGCCTTTGTT 
CTCCCAAAGT 
TCTTGGAAAG 
GCATCTCGCT 
GCTCCTGGAC 
GGCACGCGCC 
TCACTCTCGT 
CTCCCGGTTT 
CGCTGCCACG 
GGGCTGGTTT 
GGATGACAGG 
GGTCTCATCA 
GTGAGCCACT 
TTTCTTTCTT 
TGAATTATCT 
GCGAGGCGAG 
CCATTCGTAT 
CTTGGGCTTC 
GCTGAGGATT 
CCGACACATC 
CTTGCAGGCT 
GAATGAGGGT 
TCGATTTAGT 
CGGCTAAATA 
GGGCAGAACG 
GAGACAGCTT 
GGTGATGAAA 
TGCACATGTA 
GAGAGAGAGA 
CACCACCTCC 
ATGCAGTATT 
GAGTCTCTCT 
AACCTCTGCT 
CATGACGGCG 
GCTGGTCTCG 
AACGACAGGC 
CTGTGGCCCT 
CCTAGGCCAT 
GCGCACATAA 
TCTCTACACG 
GTACACTGAA 
GGCCAACGTG 
GCAGGCACCT 
CAGGCGGAGG 
GAGACCCGGT 
ATACATACAT 
AAAAGAAAGA 
TGTCTGTTTC 
TCTCTTTCTT 
TCTCACTGTG 
CTCTCCCTCC 
GTCTGTTTCT 
GTCTGCCTCT 
GTCTGTCTCT 
TCTGTCTGTC 
CCTGTCTGTT 
CATCTCTCTC 
TCTCTCTCTC 
TCTGTCTTTC 
GGGCCCTGCC 
GGAATCTAGA 



GGAGAAGATC 
CTGGGCAGTC 
TCTCTTTTCT 
CTTTCTTTCT 
CTTTTCTTCT 
GCTGGGATGA 
TGAGACGCAG 
CCGTCACCCC 
TCGAGCGATC 
ACTGTGCCCA 
GGCCTAGACT 
CAAGCGATTC 
TCTGGCTGAT 
CGAACTCCCG 
CGTGAGCCAC 
TTTCTTTGCA 
GCGCCTGGAC 
TCTTTCTTTC 
TATGATTTAT 
GCGAGGCACA 
GCACAGAGCC 
TCTCCATTCG 
AGGGGGTGTG 
CCCCGACCGC 
CACCTCTTAC 
GTGTGTGGGG 
GTCATGCCTC 
CCGCGTGTTC 
AGGGGG AC C G 
CAGGAAGAAA 
TCATCTGCAC 
TCGCTTGAAC 
GACAGAGAGA 
TTGACCTGAG 
TGGGCCCGTT 
CGCTCTGTCA 
TCCCGGGTTC 
GGCTCATATT 
AACTCCTGAC 
CTGAGCCGCC 
TACGCTCAGA 
TGCACTGTAG 
TAACTAACTA 
TCACCCATAA 
CGCAGTGGCT 
GTGAAACCCC 
GTAACCCCAG 
CTGCAGTGAC 
CTCCAGATAA 
ACATACATAC 
GAAAATGAAA 
TCTCTGTTCG 
TCTCTCTGTC 
TCTGTCTTCT 
CTGTTTGTTT 
GTCTCTCTCT 
CTCTTTCTTT 
CTCTCTCTCT 
TCTCCCTCCC 
TCTCTCCGTC 
TCTCTCTGTC 
TCTCTCTCTC 
TATGTCTGTC 
CCTTCCACGA 
CAGGCGGGCC 



ACTTGAGGCC 
ACCGGTCAAG 
TCTTTTTGCT 
TTTTCTTTTT 
TTCCTCCCTT 
CTGGCGGGAG 
AGAGCGCCTT 
GGCAGTGGTG 
CTTCCACCTC 
CACCGTTTTT 
GCAGTGCGGT 
TCCTGCATCG 
TTCGTATTTT 
ACCTCAGGTG 
CGCGCCCGGC 
ACCCTCCTGC 
TCCGGGGAAT 

TTGTGTACTT 
GCGCATCGCT 
TTATTCCCTT 
GAAGCTTGAC 
TTGGGGCTGA 
CATCGCTTGC 
TTTCATTTCT 
AGGGGGTGCG 
TTTCACCACC 
T CAT C T AG AA 
GGGACGCGGA 
ACAAAACACG 
ACTGAACACC 
GACAAATAAA 
GACAGAGAGA 
TCAGGGGGTT 
CTTTTTTTTT 
CCCAGGCTGC 
CAGTGATTCT 
CCTATTTTCA 
CTCAAATGAT 
GGGATTTCAG 
ATGACGTGTC 
CCTGGGCAGC 
ACAAACTAAC 
GTGTGTGTTC 
CACGTCTGTC 
GTCTCTACTG 
CTACTCGGGA 
CCAAGATCGC 
ATACGTACAT 
ATCCATGCAT 
GAAAAGGCAC 
TCTCTGTCTT 
TCTGTCTCTG 
GTCTTACTCT 
CTCTCTCTCC 
GTCTGTCTAT 
TTCTGTGTCT 
CTCTGTGCCT 
TTTCTGTTTC 
TCTCTCTCTT 
TGTCTCTTTC 
TCCCTGTCTG 
TCTTTCTCTG 
AAGTGAGAAG 
TTGCTGGGCT 



CCACAGGTCG 
GAGATATGCC 
TCTCTTTTCT 
CTCTCTTCCC 
CCTCCCTTCC 
GCACCATGCC 
CCAGTGATCT 
CCGTCGTAAC 
AGCCTCCAGA 
AATTGTTTTT 
GGCGCGATCT 
GCCTCCTGAG 
TAGTGGAGAC 
ATCCGCCCTC 
CTTCATTTTT 
CCGGCGTCTC 
GACTCACGAC 
TCTTTCTTTC 
ATTTTCAGAC 
TTGGAAGCCG 
CCTGGAGTTG 
AGGCGCAGGG 
AAACTGGGTC 
TCGCCCTCTG 
TCCTTTCTTG 
GGGTGGGGAC 
ACCACCACCA 
GTGGGAACTT 
AGTCTGCTTG 
AATACTGTCG 
CCCGTCACAA 
AGTTAGGGGG 
GAGAGAGAGG 
TCTGGCCTTT 
CTTCTTCTTT 
GGTCGCGGTG 
TCTTCGGTAG 
GTAGAGACGG 
CCGCCTTCCT 
C CTTTAAAAG 
CTCTCTGCCG 
AAGAGCCAAA 
TAACTAACTA 
CCGTGAGAGT 
ATCCCGAGGT 
AAAATACGAA 
GGCTGGGGTG 
ACCACTGCAC 
AAATAAATAC 
ACAGATATAC 
TGTATTGCTA 
TCTCTCTGTG 
TCTTTGTCTC 
CTTTCTCTCC 
CTCCCTGTCT 
GTCTTTCTCT 
CTCTGTCGGT 
ATCTTCTGTC 
TCTCTCTCTC 
TCTGTCTGTT 
GTTCTCTCTG 
TCTGTTTCTC 
TCAGTCTGTC 
CGCGTGCTTC 
TCCCCACTCG 



AGGCTTCGGT 
CCTTCCCCGT 
TTCTTTCTTT 
CTCTTTCTTT 
TTCTTTCCTC 
TGCTTGGCCC 
CATTGACTGA 
TCACTCCCTG 
GTACAGAGCC 
TTTTCCCCCG 
TGGCTCACCG 
TAGCCGGGAT 
GGGGCTTCTC 
CCCGGCCTCC 
AAATGTTTTC 
AAAGTGCTGG 
CACCATCGCT 

GGAGTCTCGC 
CGGCAACGCC 
GAGCTGATGC 
CCACCCAGAG 
CCCTATTTTT 
AGATCCCCCG 
CGTTTGAGGA 
GGAGGGGAGC 
CCGAAGATGA 
ACAGATGACA 
AGGGAGGAGG 
GACACAGCAC 
GTTTACCTAT 
GAGAAGAGAG 
AGGGAGAGAG 
TGGGAGAACG 

GCGCTCTCTC 
CTGGGATTAC 
GGTTTCTCCA 
GGGCCTCCCA 
CGCGGCCCTG 
TAGGTTGACT 
CTCCGNNCCC 
AACTAACTAA 
GATTTCTAAG 
CAGGAGTTCG 
ATGGAGTCAG 
GAAGAATTGC 
TACAGCCTGG 
ACACATACAT 
AAGAAAGAAA 
CTGGGCTAGG 
TCTCTTTCTC 
TCTCTCTCCC 
CCGTCTGTCT 
GTTTCTCTCT 
GTCTGTCTCT 
CTCTCTCTCT 
TTACTCTCTT 
TCTCTCTCTC 
TCTCACTGTC 
TCTGTCTGTC 
TCTATCTCTC 
AGACACCCCC 
GGTGCTTAGA 
GTGTATGATT 



CGGCCGTGAC 
TTGCTTTTCT 
CTTTCTTTCT 
CCTGCCTTCC 
CCGCCTCAGC 
AAAGAGACCC 
TTTAGAGACG 
CAGCGTGGAC 
TGGGACCGCG 
AGACAGAGTT 
CAACCTCTGC 
TGCGGGCATG 
CATGTCGATC 
GGAAGTGCTG 
CCACAGACGG 
CGTGACGGGC 
CTACTGATCC 
TCTTTCTTGA 
TCTGGGCGGG 
TTTCAAAGCC 
CTTCCGTAGC 
GCTGGCTGCG 
GATACCTCAG 
CCTCCACCGC 
GGGGGTGCGG 
GTCCTAAGGG 
CAGCAAGGAT 
GTTCTTGCAT 
GGTGGAAGGA 
TGACTACCCG 
GTCACAATCT 
GAGAGAGAGA 
GAAAACGAAA 
TTCAGCGACA 
TTTTTGGACT 
GGCTCACTGA 
AGGCGCACAC 
CGTTGGCCAC 
AAGTGCTGGA 
CCACCTTTCG 
CCTTGAGTCC 
CCACCTCCTC 
CTAACTAAAA 
AAATGGTACT 
AGACCAGCCC 
GCGCCGTGGG 
TTGAACCTGG 
GCGACAGAGT 
ACATACATAC 
AAAAGAAAAG 
GCCTTCTCTC 
TGTCTGTCTG 
TCTCTGCCTG 
CTCTCTCTCT 
CTCTCTTTCT 
TTCTCTGTCT 
CTGTCTGTCT 
TCTCTGCCTG 
TCCCCCTCTC 
TCTCTCTGTC 
TCTCTCTCTC 
GCTGTCCATC 
GTGCCGGGTA 
GAGGCCGAGA 
TCGGGAGGTC 



1 9 0 3 0 
1 ( * i 4 0 
J o 2 0 0 
l 9 2*;. o 
1 9 3 2 0 
19 3b o 
1 94-1 0 

1 9 5 0 0 
19560 
1962 0 
19680 
1974 0 
19 8 0 0 
19860 
19920 
19 9 8 0 
20040 
20100 
20160 

2 0 2 2 0 
2 0 2 8 0 
20340 
2 0400 
20460 
20520 
20580 

:0040 
2 0 7 0 0 
20760 

::08;:o 

2088O 

2 0 94o 

210 0 1 

21060 

21120 

21180 

21240 

21300 

21360 

21420 

21480 

21540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 

2 2 2 0 0 

22260 

22320 

22380 

22440 

22500 

22560 

22 62 0 

22680 

22740 

22800 

22860 

22 92 0 
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GAGGCCGGGT 
GGCGTCCGTA 
TAAGGGTCGA 
CGGCTAAATA 
GGGCAGAAGG 
GAGACAGCTT 
GGTGATGAAA 
TGCTCATGTA 
GAGAGACGGG 
AGAGAGAGAG 
TTCTGGCCTT 
TTCTTCTTCT 
GCGGTGCGGT 
TTCTTCGGTA 
AGTAGAGACG 
TCCACCTTCC 
GCCTTTAAAA 
GTCCTCTCTG 
AGCAAGAGCC 
AACTAACTAA 
AAGAAATGGT 
CGAGACCAGC 
CAGGCGCCGT 
TGGTTGAACC 
TGGGCGACAG 
CATACATACA 
AAAAGAAAGA 
TGTCTGTTTC 
TCTGTCTGTC 
TCTCTGCCCT 
TCTCTCTCAC 
TCTCTGTCTG 
TCTTTTTCTG 
CTCTTTCTCT 
CTCTCTCTCT 
TTTCTCACTG 
TCTCTCTCTG 
CCGTCTGTCT 
CTCTCTCTCT 
CTGCCTCTCT 
TTACTCTCTT 
TGTTTCTCTC 
TGTCTCTCTC 
GTCTGTCTCT 
GTCTGTCTTC 
TCTCTCCCTA 
TTTCTCTCTC 
TCTCTGTCTC 
TCTTCTGTCT 
CTCTCTCTCT 
GTCTGCCTCT 
ATCTTCTGTC 
TCTCTCTCTC 
TTTCTCTCTG 
TCTCTGTCTC 
TGTGTGTCTG 
CCCTCTCTCT 
CTCTCTCTGT 
CTGTCTCTCT 
TGGCTGTCTG 
TCTGTTTCTC 
TCTCTTTCTT 
TCTTCTGTCT 
GTCCCTCCCT 
CTGTCTCTTT 



jCTTGG 



CT PC rCCTAT 
TTTAGTGTCA 
CCGCGTGTTC 
AGGGGGACCG 
CAGGAAGAAA 
TCATCTGCAC 
TGCTTGAACG 
GAGAGAGGGG 
AGAAAGAGAA 
TTGGGAGAAC 
TTTCTTTCTT 
GGCGCTCTCT 
GCTGGGATTA 
GGGTTTCTCC 
TGGGCCTCCC 
GCGCGCGGCC 
CCATAGGTTG 
AAACTCCGTC 
AATCTCTACA 
ACTGTACACT 
CCGGCCCACG 
GGGGCAGGCA 
TGGCAGGCGG 
AGTGAGACCC 
TACATACAAC 
GAAAATGAAA 
TCTCTGTTCG 
TGTCTGTCTC 
GTCTCACTGT 
TCCCTCCCTG 
CCTCTCTCTT 
TGTCTCTCTG 
GCCTGTCTGT 
CTCTNNNCCC 
TCTCTCTCTG 
TCTCTCCCTC 
GTCTGTCTCT 
CTCTCTCTCT 
CTTTCTCTTT 
TCTCTGCCTG 
TCTCTCCCTC 
TGTCCGTCTC 
GTCTCTGTCT 
TGTCTTACTC 
CCTTTCTGTT 
TGTCTTTCTC 
TGTCTCTCTC 
TATTCTCTTT 
CTCTCTCTTT 
CTCTTTCTTT 
TTACTCTGTT 
TCTCTCTCCC 
TCTCTCTGTC 
TGCCTCTCTC 
TCTTCTGTCT 
CCCTCCCTTT 
CTGTCTCTTT 
CTGTGTCTGT 
CCTGTCTCTC 
TCTCTGCCTC 
TTTCTCTGTC 
TACTCTCTTT 
CCCTGTCTGT 
CTCTTTCTCT 



ATGCGAGGGG 
TTCCCCGATA 
TGCCTCTTTC 
TCATCTAGAA 
GGNACGCGGA 
ACAAAACACG 
ACTGAACACC 
ACAAATAAAA 
GGAGAGGGGG 
GTAAAACCAA 
GTTCAGCGAC 
TTTTTTTGGA 
CGGCTCACTG 
CAGGTGCGCA 
ACGTTGGCCA 
AAAGTGCTGG 
CTGCCACCTT 
ACTCCTTGAG 
CCCCCACCTC 
CGTCACCCAT 
GAACGCAGGC 
TGGTGAAACC 
CCTGTAACCC 
AGGCTGCAGT 
GGTCTCCAGA 
ATACATACAT 
GAAAAGGCAC 
TCTCTGTCTT 
TTTCTTTCTT 
GTCTGTCTTC 
TCTGTTTCTC 
TCTCTATCTG 
TCTGTCTCTC 
CTGTCTCTCT 
TCCCTGTCTG 
TCTGTCTGTT 
TCTGTGTGTA 
CTCTCTCCCT 
CTGTCTCTGT 
CTGTGTCTCT 
TCTATCTGTC 
TCTCGCTCTC 
TGTCTTTTTC 
CTGTCTCTCT 
TCCTTCTCTG 
TCTCTCTCGC 
TGTCTGTCTC 
TCTCTCTCTC 
CTCTCTCTGT 
CTGCCTGTTT 
TTCTGCGTCT 
TCCTTGCCTG 
TCCCTTTCTC 
CATCTCTGTC 
TCTCTCTCTC 
TACTCTCCTT 
CTGTTTCTCT 
CTCTGTCTGT 
CTCTCTGTCT 
TCTCTCTCTC 
TCTCTCTCTC 
TCTCTGTCTC 
CTCTGGCTGT 
CTGTTTCTCT 
CTCTCTGTCT 



CATTTTCAGA 
AGCTCCTCGA 
ACCGCCACCA 
GTGGGAACTT 
AGCCTGCTTG 
AATACTGTCG 
CCCGTCACAA 
GTTCGGGGGG 
GGGGAGAGAG 
CCACCACCTC 
AATGCAGTAT 
CTGAGTCTCT 
AAACCTCTGC 
CCATGACGGC 
CGCTGGTCTC 
AAACGACAGG 
TCGCTGCGGC 
TCCCCTAGGC 
CCCGCGCACA 
AAGTGTGTGT 
TTCACGTCTG 
CCCGTCTCTA 
CAGCTACTCG 
GACCCAAGAT 
TAAATACGTA 
ACAGATATAC 
TGTATTGCTA 
TCTCTCTGTG 
TCTGTCTCTG 
TATCTTACTC 
TCTCTCTCTC 
TCTCTTTCTC 
TCTCTCTCTG 
CTGTCTCTCC 
TTTCTCTCTG 
TCATTCTCTC 
TCTTTTGTCT 
GTCCCTCTCT 
CTTTCTCTGT 
CTGTCTCTCT 
TGTCTCTCTC 
TCTGTCTTTC 
TGTCTGTCTG 
CTCTCTCTCT 
CCTGTCCATC 
TAGCTCTCTC 
TTTCTCTGTC 
TCTCTCTCTC 
CTCTCTCTCT 
CTCTCTGTCT 
CTCTGTCTCT 
CCTGCCTGTC 
TTTCTCTGTC 
TTTCTATGTC 
TCTCTCTCTC 
CTCTGCCTGT 
CTCTCTCTCT 
CTGTCTCTCT 
GTGCCTATCT 
TGTCTGTCTC 
TGTCTGTCTC 
TCTCTGTGTC 
CTGCCTGTCT 
CTCTGTCTCT 
CTGTCTCTCT 



CTTTTCTCTC 
CTTCAACATA 
CCGAAGATGA 
ACAGATGACA 
AGGGRGGAGG 
GACACAGCAC 
GTTTACCTAT 
GAGAAGAGAG 
AGAGAGAGAG 
CTTGACCTGA 
TTGGGCCCGT 
CTCGCTCTGT 
TTCCCGGGTT 
CGGCTCATCG 
GAACTCCTGA 
CCTGAGCCGC 
CCTTACGCTC 
CATTGCACTG 
TAATAACTAA 
TCCCGTGAGG 
TCATCCCGAG 
CTGAAAATAC 
GGAGGCTGGG 
CGCACCACTG 
CATAAATAAA 
AAGAAAGAAA 
CTGGGCTAGG 
TCTCTTTCTC 
TCTTTGTCCC 
TCTTTCTCTC 
TTTCTGTCTG 
TGTCTGTCTG 
TGCCTATCTT 
CTCCCTTTCT 
TCTCCCTCTC 
TGTCTCTGTC 
TACTCTCCTT 
CTTTCTGTCT 
CTGTCCCTTT 
CTCTGTGCCT 
TGTCTCTCTC 
TCTCTTTCTC 
TCTCTCTCTT 
CTCCTTGTCT 
TGTCTGTCTG 
TCTCTCTGCC 
TGTCTGTCTC 
TGCCTCTCTC 
CTCTCCTTTA 
GTCTCTGTCT 
CTCTCTCTCT 
TGTGTGTCTG 
TCTCTCTCTC 
TGTCTCTCTC 
TCTGTCTGTC 
CCGTCTGTCT 
TTCTGTCTGT 
CTTTCTTTTT 
TCTGTCTTAC 
CGTCCCTCTC 
TTTCTCTGTC 
TGTCTCTCTT 
CTCTCTCTCT 
GTCTCTCTGT 
CTCTCTCTGC 



GGTCACGTGT 
AACGGCGTCC 
AAGCAAAGAT 
GTTCTTGCAT 
GGYGGAAGGA 
TGACTACCCG 
GTCACAGTCT 
GAGAGAGAGA 
AGAGAGAGAG 
GTCAGGGGGT 
TCTTTTTTTC 
CACCCAGGCT 
CCAGTGATTC 
TTCTATTTTT 
CCACAAATGA 
CGGGATTTCA 
AGAATGACGT 
TAGCCTGGGC 
CTAACTAACT 
AGTGATTTCT 
GTCAGGAGTT 
GAAATGGAGT 
GTGGAAGAAT 
CACTACAGCC 
TACACACATA 
AAAAGAAAAG 
GCCTTCTCTC 
TGTCTGTCTG 
TCTCTCTCCC 
CCCGTCTGTC 
TTTCTGTCTC 
CCCCTCTCTT 
CTGTCTTACT 
GCTTCTCTCT 
TTTCTGTCTG 
TCTGTCTCTC 
CTCTGCCTGT 
GTTTCTCTCT 
CTCTGTCTGT 
ATCTTCTGTC 
CCTGCCTTTC 
TCTGTTTCTC 
TCTTTCTGTC 
CTCTCACTGT 
TCTCTCTCTC 
TGTTTCTCTC 
TTTCTCTCTG 
ACTGTGTCTG 
CTGTCTGTTT 
TTCTCTGTCT 
CTCTGTTCCT 
TCTCTCTCTC 
TTTCTGGGTG 
TTTCTCTCTG 
TCTCTCACTG 
GTCTGTCTCT 
TTCTCTCTTT 
CTCTGTCTCT 
TCTCTTTCTC 
TCCCTGTCTG 
TGTCTGTCTC 
TCTGTGCCTA 
GCCTGTCTCC 
CCATCTCTGT 
CTGTCTCTCT 



.:2 9 SO 
2 3 040 
3 1 :)0 

2 3 1 6 0 

3 3 2 2 0 
2 3 2 8 0 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
2472 0 
2 4780 
2484 
.490' 

2496 ) 

25020 

2 508 0 

25140 

2520") 

25260 

25320 

25380 

2 5440 

25500 

25560 

2562U 

25680 

25740 

25800 

25860 

25920 

25980 

26 040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 
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CACTGTGTC 
CTCTCCCT' 
GTCTGTTT 1 
CTCTCTCTO 
CCATGTCTC 
GTTTCTTTG' 
CTCTCTGTT' 
CTCTGTCTCT 
GTCTTTCTAT 
CCTGCCCTTC 
CTAGACAGGC 
CGGGTCCCCG 
CCGTACTTGT 
GACGCCAACA 
GGGCAGGCCC 
GGAAGCGGAG 
TGAGTGAGAC 
TGCTGACGGA 
TTATTGAAAG 
ACGGGTTTCT 
CTCGCCTAGG 
CTTTGTGTTT 
CAAGTTGCGC 
TTAGGTGGGT 
CCTTCCAGAG 
CGATCTCATT 
CACCGTTTTT 
GGCATAGCCC 
CCCGCTGCTT 
TCACCTTTTA 
GAAGGGCGCC 
CTGAAAACTA 
GTTGGCTTGT 
TTTCTTTTCA 
CACGTTAGCT 
CAGTGAGAGC 
GTAGGCGAAT 
TGTTGTCGTT 
TTTATGGGAT 
CCCCTCTCTC 
CTCTCTCTCT 
CTCTCTCTCT 
TGTCTGTCTG 
TCTCTCTCTC 
TTTCCTTCTC 
TCTCTGCCTG 
TCTCTCTGTC 
TGTCTTTCCT 
CTCTCTCTCT 
CTCTCTCTCT 
CTCTCTGTGT 
TCTCTGTCTG 
CTCTCTTTCT 
CTCTGTCTGT 
GTGTATGTGT 
TGTCTGTCTG 
TTTCTCTCTC 
TTTATCTGTC 
TCTCTGTGTA 
TCTCTCTCTC 
TCCCGCCCTC 
GGCCTGAATT 
ACTGGCGAGT 
AATCCTAAAA 
CCTAGGATGC 



GTCTTCTGTC 
ATGTCTCTCT 
CTCTCTGTGT 
GTCTCTCTCT 
CTCTCTCCCT 
CTGTCTGTCT 
GTCTTTCTCC 
CTCTCTTTCT 
GTCTGTCTCT 
CACGAGAGTG 
GGGCCTTGCT 
CTTGGATGCG 
CCTATTTCCC 
CGGCGAAACC 
TGTAATGCCA 
GCTGCAGGGA 
TCGGTCTCTA 
CATTTGCAGG 
TCGACGTTGA 
CTCTCTCCCT 
GAACCTCCGC 
TGGCGCCTAG 
CCCGGCTCCC 
TTCCCCCAAA 
CCACCCCGGT 
CTTGCCAGGC 
GAAGATGGGG 
CTTGACCCGC 
CCCGCTCAGG 
TCACGATGTT 
ACGGCTCTAG 
ATAACTTTNC 
TTTGTTTCGT 
GGTGAAGTAG 
GCCGTTTTTT 
CGGTTGATGT 
GCTGCTGCTG 
GTCGTTGTTG 
CAAAAGCATT 
TGTCTCTCTG 
CTGTGTCTCT 
CTCTGCCTGT 
TCGGTCTCTC 
TCTTTCTGTT 
TCTGTCTCTC 
CCTGTCTCTC 
TCTCTGTCCG 
TCTCTCTGTC 
CTCCCTGTCT 
CTCTCTCTCT 
GTCTGTCTTC 
TCTCTCTCTC 
GTCTGTTTCT 
CTCTGTCTCT 
GTGTGTGTGT 
CCTGTCTGTT 
TTTCTGTTTC 
TCTCTCCGTC 
TCGTTGTCTC 
TCTCTGTCTG 
TCTTTTTTTG 
CTTCACTTCT 
TGATTTCTGG 
TCTGGGGTGG 
CGGAAGAGTT 



TTACTCTCTT 
CTCTCTCTCA 
GTCTCTCTCC 
CTCTTTCTGT 
CTCACTCACT 
GTCTGTCTGT 
CTCCCTGTCT 
CTTTCTGTCT 
TTCTCTGTCA 
AGAAGCGCGT 
GGGCTTCCCC 
AGGGGCATTT 
CGATAAGTCT 
CCGTCTCTAC 
GCTCCTCGGG 
GCCGAGATCG 
AATAAATACG 
CAGGCATCGG 
CACGGAGGGA 
TCTGGAGGCC 
CCTGGGGGCC 
ACTCTTCTAC 
CCCACTACCC 
CCGCCCCCCC 
GTGCCTCCGT 
TGACATTTGC 
GCGGCACGGT 
GTGGGCAAGC 
CCTCCCTCCC 
TTAGTTTCTC 
TCTGGGCCTT 
TCACTTAAGA 
TCTGTTTTGT 
AAATCCCCAG 
CCTGTTGTGA 
TTACNATCCT 
CTCTTGTTGC 
TTGTCGTTGT 
ATAAAATATG 
TCTGTCTCTG 
CTCTCTCTGC 
CTCTCTCACT 
TCTCTCTCTC 
TCTCTCTCTC 
TCTGCCTGTC 
TCACTCTCTC 
TCTCTGTCTT 
TCTGTCTCTC 
GTCTGTCTCT 
CTCTGTCTTT 
TGTCTTACTG 
TCTCTCCCCC 
CTCTGTCTGT 
CTCTCTGTCT 
GTGTGTGTGT 
TGTCTCTCTC 
TCTCTGTCTC 
TGTCTCTTTA 
TCTCTGTCTG 
TCTGTCCGTC 
CAAAAGAAGC 
GACATCCCAG 
ACTTGGATAC 
CTTCTCCCTC 
TTCTCAATGT 



TCTCTTGCCT 
CTCACTCTCT 
CTCCATGTCT 
CTGTTTCTCT 
CTCTCTCCCT 
CTGTCTCTCT 
GTCTGTCTGT 
GTTTCTCTCT 
GTCTGTCAGA 
GCTTCGGTGC 
ACTCGGTGTA 
TCAGACTTTT 
CCTCGACTTC 
TAAAAATACA 
AGGCTGAGGC 
CGCCACTGCA 
GAAATTAATT 
TTGTCTTCGG 
GGTCTCGCCG 
CCTCCCTCTC 
CTATTGTTCT 
TTGGGCTTTG 
ACGTCCCTTC 
CCCCCCGCCT 
CTTCTCTCCC 
ATCGGTGGGC 
CCCACTTCCC 
GGGCGGGTCT 
TAGGAAAGCT 
CGCCCTCCGG 
CTCAGTACTT 
TTTCCAGGGA 
TTTGTTCGTG 
TTTTCAGGAA 
ACTAGCGCTT 
TCATCATGAC 
TGTTGTTGTT 
CGTTGTTTTC 
TGTGATTATT 
TCTCTCTCTT 
CTGTCTGTTT 
GTGTCTGTCT 
TCCCTGTCTG 
CGTCTCTGTC 
TCTCTCACTC 
TCTCTGTGTG 
TCTCTGTCTG 
TCACTGTGTC 
CTCTCTCTCT 
GTCTTTCTTT 
TCTTTCTCTG 
TGTCGGCTGT 
CTTTCTCTCT 
CTCTCTCTCT 
CTGCCTTCTG 
TCTCTGCCTG 
TGTCCATCTC 
TCTGTCTCTC 
TCTCTGTCTC 
TGTCTGTCTC 
TCAAGTACAT 
ATTTGATCTC 
CTCATAGAAA 
GACTGTCTCG 
GCATCTGCCC 



jCCTCTCTGT 

:tccgtctct 

CTCTCTCTCT 

:tctgtctgt 

CTCTCTCTCT 
CTCTCTCTCT 
CTCTCTCTCT 
ATCTCTCGCT 
CACACCCGTG 
TTAGAGAGGC 
CGATTTCGGG 
CTCTCGGTCA 
AACATAAACT 
AAGCTGAGTC 
GGGAGAATCG 
CTACGGCCCA 
AATTCATTAA 
GCATCACCTA 
ACTTCACCGA 
TCCCTCGTTG 
TTGATCGGCG 
GGAAGGGTCA 
ACCTTAATTT 
CCCAACACCC 
CTTCCCCCAC 
GTCAGGCCTC 
CGGAGGCAGC 
GCAGTTGTGA 
TCACCCTGGC 
CCAGCAGAGT 
GCCCAAAATA 
CGGCGCCTTG 
TTTTTCCTTT 
GACGTCTATT 
TTGTGACTCT 
ATCTTATTTT 
GTTGTTGTTG 
AAAGTATACC 
TCTTGAGCAC 
TCTCTGTCTG 
CTCTCTCTCT 
TCTGTCTTAC 
TATGTTTCTC 
TTTCTCTGAC 
TGTCTTCTGT 
TCTCTCTCTC 
TCTCTTTGTC 
TGTCTTCTGT 
CCCCCTGTCT 
CTGTCTCTGT 
CCTGTCTGTC 
TTCTCTGTCT 
CTGTCTCTTT 
GTGGGGGTGT 
TCTTACTCTC 
TCTCTCTCCC 
TGTCTTTCTC 
TCTCTCTTTC 
TGTCTCTCTG 
GGTCTCTGCG 
CTAATCTAAT 
CCTACAGAAT 
CTACATATGA 
AAAAATCGTA 
GTGTCCTAAG 
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304*J0 


CCCTTACCAA 


30480 


GCTGTACAGA 


30540 


ATAAAGATCC 


3 0 6 0 0 


CCTCTGTTCC 


30660 


TGATCTGTGA 


30720 
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AC?JpcSci" ?rr'^TT^^ CTCAAATAT-j TACGTGCAAA CACTTCTCTC CATTTPCA-A ,^ 80 
g£?tSto- G?kn"L^r " ^ CCACTG '^ CTCTTTGAAA AAAATCCCAG AAGTGGTTT? -^J? 



■J '0 0 
0'.*6 0 
] U 2 0 

i / « o n 



GGPTTTTTPP prAP,^^/-^ m „ rt^^i^^^A^ AAGTGGTTTT 

mm pi siiiin 

ieiiil 

illlli 

ss = s— ~1 iiii ™- 
ssS s™ i™ ™" =s = sis 

TGTAATAAGT 2S?£5£ SSSSJJIS AAATGATAGA ACTCCACTGC I,:.,,. 

SS sssssj SSSSSS? sssss 

■■■■■E ... 

ATTTCCTATG TGCCTACTTA TACATrarTA ^aa*™^™ ACAATA TGAC CTAGGTAGTA i3120 
TAAAGGGTAC GTgSSSS SagS SJfESS^ ACTGCTAAAT 33180 

SESSS SSESEf SSSSSS SS SSSSSSg 

s=lisltlfs= .... 

Ill II tiJIll tills s 

SS£ SHE IS HI S5S "sS 

SSSSKJ SST ™™ SS3S3? SSSSSS 

iiE « iiE iiii ssssss ssss 

GGACTGGAGT TACCTTTCGG cS^CC^ TCTCCC^ AGAACTTTGT GTTTTTGCAA 

L1LU1LCCC TCTGCGAGAA GACAGACGGT GTTCCGGTTT 



■ i :: h o 
"U3 20 
1 3 8 o 
■32440 

i s o o 

3 1 560 

3 1620 

316 8 0 

31740 

31800 

31860 

31920 

31980 

3 2 040 

3 210 0 

32160 

3 2 2 2 0 

32280 

32340 

32400 



3264 0 

32760 
32820 
^2880 
32 940 
33000 



33240 
3 3 3 0 0 
33360 
3 3 4 2 0 
3 3 4 8 0 



33720 
3 3 7 8 0 
5 3 840 
3 3 9 0 0 
33960 



3 4200 
34260 
34320 
34380 
34440 
34500 
34560 
34620 
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GGCCGATTCT 
ACGGTGTCTC 
TAATGGCAAA 
AGTTCGACCA 
CAGTGGGACA 
ATTCAAGGAG 
CTGGGGTGTT 
AAGTGTTGTG 
AAATGTTTGG 
CAGAGGAAAG 
TCTGTCTTGC 
TCAGGAACGA 
TGATCCCCTT 
TGCGGGGTAG 
CGGTAAGAAC 
CGTTCTGCGG 
GCCTGGACGG 
CACAGGCACC 
GCACCAGCCG 
CTGACTCTGG 
AGAACACGGA 
AAGGACTCTC 
GTGCGGACCG 
CTTGGCAGAT 
CGGATGCCCC 
CGCGCGGCCA 
AGAGTCTCTC 
GTGAGACCAG 
CTTTGGGAGG 
GTTGCAGTGA 
AGGCCCTGCC 
TCTTCTCAGG 
CCTTCGGACG 
CACCGCAACC 
GGGATTACAG 
TTCTCCATGT 
CCTCCCAAAG 
ATTTATTTTT 
TACATTTATT 
TCTGCTGCCC 
GGGTTCACGC 
CCGTGCCCGG 
ATGGTCTCGA 
ACAGGCGTGA 
TGAAACCAGT 
CCGTGTTGCC 
GATGACAGGC 
CCCGACAGAC 
AAATTTGGGA 
CTATCACAGG 
TGTGATACAG 
TTAGCCCTGG 
CAGTCGGTCT 
CTGCGTCCCC 
ACCCACTGTG 
TTCATTCCGG 
CTCTGAAAAC 
GCAACTGTGT 
CTAGGAAATC 
AAACAGATAA 
ACCCATTACA 
ATACAATAGG 
ATACAATACA 
GATGCCGAGG 
AGAAATCCCG 



GGCAACAGGC 
ATACCAGTGC 
ATGCCAACAC 
GTTTTTGGTT 
GTATCAGGTG 
TACTTTGAAT 
TCGGTAAGAA 
GCGCAAGACT 
TCAGGATCTG 
GATTTCAACG 
AATATACATG 
TAATATGATC 
CTGCAAGCTT 
TACAACTGTG 
AGGTCGCCAG 
GGGAAACCGC 
GTCCCTGCAG 
CCCCCCCCCC 
CCGTTACCGG 
AAAGCCCGGC 
TCCACTCCCA 
TTCCTTCTGA 
CTGACTCCCT 
TCTGGCCCTT 
GAGTGACTGT 
CCTGAAACCA 
GCCGACTCTC 
ATGTATTAAC 
CCGAGGCCGT 
GTGAGCCATA 
ACAGGCAGGC 
GTAGGAAGCA 
GAGTTTCACT 
TCCACCTCCC 
GGAGGAGCCA 
GGGTCAGGCT 
TGCTGGGGTG 
TTAATTATTT 
TATTTATTTA 
AGGCTGGAGT 
CATTCTC CTG 
CTAACTTTTT 
TCTCCTGACC 
GCCACCGGCC 
TAGTTTTTGT 
AAGGCTTGGA 
GCGAGCCTAC 
AGTTTCACGG 
CGTCAGCTTC 
AC CGTAC ACG 
AAAT AC AC GG 
TGTCCGTGCC 
CTCGTCTCTG 
CCAGGAGCCC 
CTGTGGGAGC 
GCTGACACGC 
GGAGGCCTCA 
CTTCTCCACC 
GCCACTTTGA 
ATAAATAAAA 
ATACAATAAG 
ATACGATACA 
ATACAATACG 
TGGACGCATC 
TCTCAATTGA 



TTTTCTGAAG 
AGTTTTGTCA 
TTGGGGTTAA 
TATGTTGAAC 
AAAGGACAGC 
TGGAAGATTC 
CGGTCTCGGG 
TGGACAGGCA 
GCTTTTCCCC 
AAGGCTCTTT 
TCCCGACGAT 
GTGGCTTTTC 
GCTGAGATCA 
TCCTTTCAAG 
TAAGAACAAG 
ATCTCGGTAG 
CACCGCCATC 
CATAGCGGCT 
GGGATGGGGG 
GCCTTGTGAT 
AGTTCAGTGG 
TTCGGTCTGC 
CTACCTTGGG 
TCTGGCCCTT 
GGCTCGCACC 
CAGGAGCTCG 
TCTTGACTTG 
TCAGGCCGGG 
AGGATCCCTC 
GTTGTGTCAC 
AGGCAGGCAG 
AAAATAACAG 
CTTGGTGCCC 
GCGTTCAAGC 
CCACACCCAG 
GGTCTCGAAC 
ACAGGCGTGA 
TACTTTTTTT 
TTTATTTACT 
GCAGCGGCGT 
CCTCAGCCTC 
GTATTTTGAG 
CCGTGATCCG 
CCGGCCTATT 
AATTTTTTTT 
CCGAGGGATC 
CGCGCCCGGA 
CAGAGCGTTT 
TGGCCTCACG 
TAAGGAGGAG 
ATTCACCCAA 
AGTGATTCTT 
GACGGAAGTT 
TGGTCGATTA 
CTCCATCCTT 
TCACTGGCAG 
CAGAGGAAGG 
GCCCCCGCCC 
CGACCGGGTC 
TAACACAAAA 
ATACGATACG 
ATACAATACA 
CCGGGCGCGG 
ACCTGAAGTC 
AAATACAAAA 



GGGCTCCGGT 
ATAGGGTCCG 
TGGACTAACA 
CTGTTTAGAT 
TGAATCGATA 
TAAATTCCAT 
CTGTCTGTGA 
GTTGCTAAAG 
CCTATTTCAC 
TGGTCACATT 
GG AAGGGGAA 
TGCTTATGAA 
ACACAACATT 
AGTCTATATG 
GCTTCTTCTG 
GC AT AGTGGT 
CTCGAGGCTC 
CCGGCCCGGC 
AGTCCGAGAC 
CCATTGCAAA 
GGGGATGTGA 
ACAGTGGGGC 
TTCCCTCGGC 
CAGTCGCTGT 
TCTCCGGAAA 
GGACACACGT 
AGTTCTTCGT 
TGCTGGTGGC 
GAGGAATCGC 
TGTGCTCCAG 
GCAGGCAGAA 
AATACAGCAC 
ACGCTGGAGT 
GATTCTCCTG 
CTGATTTTGT 
TGGCGACCCC 
GCCATCGTGA 
TAGTTTTCCA 
TATTTATTTA 
GATCTCGGCT 
CCAAGTAGCT 
TAGAGATGGG 
TCCACCTCGG 
TATCTATTTA 

CACCGGCCCT 
CCCCCCCTTT 
GGCTGGCGTG 
GACTCTGAGC 
AAAAATCGTA 
AACACAGAAA 
TTCGGTTTGG 
CCAGATGATC 
GTTGTGGGGA 
CCCCCCACCC 
GCGTCGGGCA 
GAGCACCAGG 
CCACCTCCAA 
TGATTGACCT 
GTAACTAACT 
ATAGGATGCG 
ATACAATACA 
TGGCTCATGC 
GGGAGTTGGA 
CTAGCCGGGC 



GGATGGCACG 
TCTCCGGGAC 
GCTGCTGGTC 
CATATGGAAG 
GAAGACACTG 
CCGTTTCATT 
CATAAACTAG 
C T C T C TAG AG 
AT CAT GAT TC 
CTGATCCTTT 
AGCGAGCTGA 
ACACTCCACC 
TCGCAAGCAG 
TTTTATAGGC 
AGTGTACTTC 
TTAGTGCTTG 
AGGCCCACTT 
CAGCCCCGGC 
AGAATGACTT 
CCGAGAGTCA 
GGGGTGTGGC 
CTAGGGCTGG 
CCCACCCTGG 
CAGAAACCCC 
CATTGGAAAT 
GCTTTCGGGA 
GGGTGCGTGG 
TCACGCCTGT 
CTAACCCTGG 
TCTGGGCGAA 
AGACAACAGC 
TTAATTAATT 
GCAGTGGCAC 
CCTCAGCCTC 
ATTGTTAGTA 
AGTGGATCTG 
CTGGCCGGCT 
TTTTAATCTA 
TTTTCGAGAC 
CACTGCAACG 
GGGACTACAG 
GTTTCACTGT 
CCTCCCAAAG 
TTAACTTTGA 
TTTTTTGAGA 
CGGCCTCCCA 
CCCCTTCCCC 
CTTAAACTCA 
CGAGGAGTCC 
ACGTTCAAAG 
C C AGTCTTTT 
ACCTTGACTG 
CGATGGGTGG 
TCGCCTTGGA 
CCTCCCCAGG 
TCACCTAGCG 
CCGCCTGCGC 
GTTCCTCCCT 
TTGATCAGGC 
AAATAAAATA 
ATAGGATACG 
ATACAATACA 
CTGTCATCCC 
GACAAGCCCG 
GCGGTGGCAC 



TCAGTGACAG 
TTGGGGTTTO 
CTCCTAATAA 
TTCCTGTTCC 
GGGAGTCTGT 
CGACGGTGTC 
GACGAGGTCC 
AGGTGAATCA 
AAAGGGACAC 
GGTAAGCCGA 
ATCACCAAAC 
CGATAAGATT 
GCATTTGCAT 
CTTTCCTGAG 
TGCATAAAGG 
CCATATAGCA 
TCTGCAGTGC 
TCATTTAAAG 
CTTTATCCTG 
CCTCGTGTTT 
AGGTAGGACG 
AGCTCTCTCC 
AACGC CGGGC 
ATCTCATGCT 
CTCTCCTCTA 
GAGAATGCTG 
TTAAGACGTA 
AACCCCAACA 
GGAGGTTGAG 
AGACAGAATG 
TGTATTATGT 

CATCTCGGCT 
CTGAGTAGCT 
GAGACGGCAT 
CCCGCCCCGG 
ACGTTTATTT 
TTTATTTATT 
AGACTCTCGC 
TCCGCCTCCC 
GCGCCCGCCA 
GGTAGCCAGG 
TGCTGGGATG 
GTCCAGGTTA 
CGAGGTTTCA 
AAAGTGCGGG 
CGCTTGTCTT 
TTCTAAATAG 
CCTGGTCTGT 
TCAGTCATTT 
AGAAATGGCC 
AG AGG AT T C C 
GGGACTTAGG 
GGGCGCGGTG 
GGGATCCCAA 
GTCACTGTTA 
ACAGCCTGGG 
CCCTTGTTGC 
AAAAACGAAC 
AGTCAATACA 
ATAGGATACA 
ATACAATACA 
GTCACTTTGG 
ACCAACATGG 
ATGCCTATAA 



3 4 <> 8 0 
3 4 74d 
34 HO 0 
34Hf,0 
3 4 '.On 
3 4 ' * H 0 
3 5 (.)4 0 
3 SI no 
3 51 <-■ 0 

35::::: o 
35:: so 

3 534 0 
3 54 0 0 
3 5460 
35520 
35580 
35640 
35700 
35760 
35320 
35880 
3 5 940 
3 6 0 0 0 
3 6 0 6 0 
3 612 0 
36180 
36240 
3 6 3 0 0 
36360 
36420 
3 6480 
3 6 540 
t - . ■ / 
3 6 o ' . 0 
36720 
36780 
3 6 840 
36900 
3 6 960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
3 8 040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
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tcccagctgc taggaaggct gaggcaggag aatcgcttga acctgggaag cggaggttgc 3£5«o 

agtgagccga gattgcgcca tcgcactcca gtctgagcaa caagagcgaa actccgtctc 3 864 0 

aaaaataaat acataaataa atacatacat acatacatac atacatacat acatacatac :ie-7oc 

ATAAATTAAA ATAAATAAAT AAAATAAAAT AAATAAATGG GCCCTGCGCG GTGGCTCAAG 3 87t,0 

CCTGTCATCC CCTCACTTTG GGAGGCCAAG GCCGGTGGAT CAAGAGGCGG TCAGACCAAC 3 8 820 

AGGGCCAGTA TGGTGAAACC CCGTCTCTAC TCACAATACA CAACATTAGC CGGGCGCTGT 3 8880 

GCTGTGCTGT ACTGTCTGTA ATCCCAGCTA CTCGGGAGGC CGAGCTGAGG CAGGAGAATC 3 8 940 

GCTTGAACCT GGGAGGCGGA GGTTGCAGTG AGCCGAGATC GCGCCACTGC AACCCAGCCT 3 90(>0 

GGGCGACAGA GCGAGACTCC GTCTCCAAAA AATGAAAATG AAAATGAAAC GCAACAAAAT 3 9 060 

AATTAAAAAG TGAGTTTCTG GGGAAAAAGA AGAAAAGAAA AAAGAAAAAA ACAACAAAAC 3 912 0 

AGAACAACCC CACCGTGACA TACACGTACG CTTCTCGCCT TTCGAGGCCT CAAACACGTT 3 918 0 

AGGAATTATG CGTGATTTCT TTTTTTAACT TCATTTTATG TTATTATCAT GATTGATGTT 3 924 0 

TCGAGACGGA GTCTCGGAGG CCCGCCCTCC CTGGTTGCCC AGACAACCCC GGGAGACAGA 3 9300 

CCCTGGCTGG GCCCGATTGT TCTTCTCCTT GGTCAGGGGT TTCCTTGTCT TTCTTCGTGT 3 9360 

CTTTAACCCG CGTGGACTCT TCCGCCTCGG GTTTGACAGA TGGCAGCTCC ACTTTAGGCC 3 942 0 

TTGTTGTTGT TGGGGACTTT CCTGATTCTC CCCAGATGTA GTGAAAGCAG GTAGATTGCC 3 9480 

TTGCCTGGCC TTGCCTGGCC TTGCCTTTTC TTTCTTTCTT TCTTTCTTTA TTACTTTCTC 3 9S40 

TTTTTCTTCT TCTTCTTCTT CTTTTTTTTG AGACAGAGTT TCACTCTTGT TGCCCAGGCT 3 960 0 

AGAGGGCAAT GGCGCGATCT CGGCTCACCG CACCCTCCGC CTCCCAGGTT CAAGCGATTC 3 966 0 

TCCTGCCTCA GCCTCCTGAT TAGCTGGGAT TACAGGCATG GGCCACCGTG CTGGCTGATG 3 972 0 

TTTGTACTTT TAGTAGAGAC GGTGTTTTTC CATGTTGGTC AGGCTGGTCT CCCACTCCCA 3 9780 

ACCTCAGGTG GTCCGCCTGC CTTAGCCTCC CAAAGTGCTG GGATGACAGG CGTGCAACCG 3 9840 

CGCCCAGCCT CTCTCTCTCT CTCTCTCTCT CTCGCTCGCT TGCTTGCTTG CTTTCGTGCT 3 9 900 

TTCTTGCTTT CCCGTTTTCT TGCTTTCTTT CTTTCTTTCG TTTCTTTCAT GCTTGCTTTC 3 996 0 

TTGCTTGCTT GCTTGCTTTC GTGCTTTCTT GCTTTCCTGT TTTCTTTCTT TCTTTCTTTC 4 0 020 

TTTCTTTCTT TTGTTTCTTT CTTGCTTGCT TTCTTGCTTG CTTGCTTGCT TTCGTGCTTT 4 00 80 

CTTGCTTTCC TGTTTTCTTT CTTTCTTTCT TTCTTTTCTT TCTTTCTTGC TTGCTTTCCT 4 014'" 

GCTTGCTTGC TTTCGTGCTT TCTTGTTTTC TCGATTTCTT TCTTTCTTTT GTTTCTTTCC 4 0200 

TGCTTGCTTT CTTGCTTGCT TGCTTTCGTG CTTCTTGCTT TCCTGTTTTC TTTCTTTCTT 4 0260 

TCTTTCTTTT GTTTCTTTCT TGCTTGCTTT CTTGCTTGCT TGCTTTCGTG CTGTCTTGTT 4 0 32'. 

TCTCGATTTC TTTCTTTCTT TTGTTTCTTT CCTGCTTGCT TTCTTGCTTG ATTGCTTTCG 4 038i 

TGCTTTCTTG CTTTCTTGTT TTCTTTCTTT CTTTTGTTTC TTTCTTTCTT GCTTCCTTGT 4 0- 1 -4' 

TTTCTTGCTT TCTTGCTTGC TTGCTTTCGT GCTTTCTTGT TTTCTTGCTT TCTTTCTTTT f .0^ 

GTTTCTTTCT TGCTTGCTTT CTTGCTTCCT TGTTTTCTTG CTTTCTTGCT TGCTTGCTTT 4 056 0 

CGTGCTTTCT TTCTTGCTTT CTTTTCTTTC TTTCTTTTCT TTTTCTTTCT TTCTTGCTTT 4 0 620 

CTTTTCTTTC AT CAT CATC T TTCTTTCTTT CCTTTCTTTC TTTCTTTCTT TCTATCTTTC 4 068<". 

TTTCTTTCTT TCTTTCTTTC TTTCTTTCTT TCTTTCTGTT TCGTCCTTTT GAGACAGAGT 4 0 74 - 

TTCACTCTTG TTTCCACGGC TAGAGTGCAA TGGCGCGATC TTGGCTCACC GCACCTTCCG 4 0 80O 

CCTCCCGGGT TCGAGCGCTT CTCCTGCCTC CAGCCTCCCG ATTAGCGGGG ATTGACAGGG 4 0 860 

AGGCACCCCC ACGCCTGGCT TGGCTGATGT TTGTGTTTTT AGTAGGCACG CCGTGTCTCT 4 0 920 

CCATGTTGCT CAGGCTGGTC TCCAACTCCC GACCTCCTGT GATGCGCCCA CCTCGGCCTC 4 0 980 

TCGAAGTGCT GGGATGACGG GCGTGACGAC CGTGCCCGGC CTGTTGACTC ATTTCGCTTT 4104 0 

TTTATTTCTT TCGTTTCCAC GCGTTTACTT ATATGTATTA ATGTAAACGT TTCTGTACGC 41100 

TTATATGCAA ACAACGACAA CGTGTATCTC TGCATTGAAT ACTCTTGCGT ATGGTAAATA 41160 

CGTATCGGTT GTATGGAAAT AGACTTCTGT ATGATAGATG TAGGTGTCTG TGTTATACAA 412 2 0 

ATAAATACAC ATCGCTCTAT AAAGAAGGGA TCGTCGATAA AGACGTTTAT TTTACGTATG 412 8 0 

AAAAGCGTCG TATTTATGTG TGTAAATGAA CCGAGCGTAC GTAGTTATCT CTGTTTTCTT 41340 

TCTTCCTCTC CTTCGTGTTT TTCTTCCTTC CTTTCTTCCT TTCTCTCCTT CTTTAGGTTT 414 0 0 

TTCTTCCTCT CTTCCTTTCC TTCTTTCTCT CTTTCTGTCC TTTTTTCCTT CGTGCTTTAT 414 6 0 

TTCTCTTTCG TTCCCTGTGT TTCCTTCTTT TTTCTTTCCT CTCTGTTTCT TTTTCCCTTC 4152 0 

TTTCCTTCGT TTCTTTCCTC ATTCTTTCTC TCTTTTTCGT TGTTTCTTTC CTTCCCGTCT 41580 

GTCTTTTAAA AAATTGGAGT GTTTCAGAAG TTTACTTTGT GTATCTACGT TTTCTAAATT 41640 

GTCTCTCTTT TCTCCATTTT CTTCCTCCCT CCCTCCCTCC CTCCCTGCTC CCTTCCCTCC 417 0 0 

CTCCTTCCCT TTCGCCATCT GTCTCTTTTC CCCACTCCCC TCCCCCCGTC TGTCTCTGCG 4176 0 

TGGATTCCGG AAGAGCCTAC CGATTCTGCC TCTCCGTGTG TCTGCAGCGA CCCCGCGACC 41820 

GAGTCCTTGT GTGTTCTTTC TCCCTCCCTC CCTCCCTCCC TCCCTCCCTC CCTCCCTGCT 41880 

TCCGAGAGGC ATCTCCAGAG ACCGCGCCGT GGGTTGTCTT CTGACTCTGT CGCGGTCGAG 41940 

GCAGAGACGC GTTTTGGGCA CCGTTTGTGT GGGGTTGGGG CAGAGGGGCT GCGTTTTCGG 42000 

CCTCGGGAAG AGCTTCTCGA CTCACGGTTT CGCTTTCGCG GTCCACGGGC CGCCCTGCCA 42060 

GCCGGATCTG TCTCGCTGAC GTCCGCGGCG GTTGTCGGGC TCCATCTGGC GGCCGCTTTG 4212 0 

AGATCGTGCT CTCGGCTTCC GGAGCTGCGG TGGCAGCTGC CGAGGGAGGG GACCGTCCCC 42 180 

GCTGTGAGCT AGGCAGAGCT CCGGAAAGCC CGCGGTCGTC AGCCCGGCTG GCCCGGTGGC 4224 0 

GCCAGAGCTG TGGCCGGTCG CTTGTGAGTC ACAGCTCTGG CGTGCAGGTT TATGTGGGGG 42 3 00 

AGAGGCTGTC GCTGCGCTTC TGGGCCCGCG GCGGGCGTGG GGCTGCCCGG GCCGGTCGAC 42 3 60 

CAGCGCGCCG TAGCTCCCGA GGCCCGAGCC GCGACCCGGC GGACCCGCCG CGCGTGGCGG 424 2 0 
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AGG 


"TGGGGA 


CG JCCTTCCC 


GG 2CCGGTCG 


CGGTCC' jCTC 


AT CCTG 






_*( 7 


1 1 1 i"( J A ( j(. 3 


C "< T ( . J 


*i * 


1 q r\ 
4 0 u 


CGG 


"CGAATT 


j r r r i _ ( _GA'j 


ATCCCCGTGG 


GGAGCCGGGG 


ACCGTC 


- 




.3"'.. 


I..' I 1 1 ..7 ' I 1 1 C 


.. ... „ 


1 ' 
■i . . 


) -i u 


CGG 


3TGCCGG 


GGAGCGGTCC 


gcgGgCgGgg 


LCIjLIjIj I cc^ 


TCTGCC 


j 




jA 


r T 1 -i ■ i r t l . T 1 , T I i 
1 L 1 1 1 






c" ri n 


CGA 


jTCCCCG 


TGGCCAGTCG 


GAGAGCGCTC 


CCTGAGCCGG 


TGCGGC 






ZA 


GAGGTCG 


2GC 


41 


660 


TGG 


TCGGCCT 


TCGGTCCCTC 


GTGTGTCCCG 


GTCGTAGGAG 


GGGCCG 


j 




ZG 


AAAATGC 


ft-:: 


41 


72 0 


CGG 


"TCCCGC 


TCTGGAGACA 


CGGGCCGGCC 


CCTGCGTGTG 


GCCAGG 


j 






rp^r^ ag 




4 : 


V M 0 


TCC 


:cggccc 


GGCGCTGTCC 


CCGCGTGTGT 


CCTTGGGTTG 


AC C AG A 


j 


j 


ZA 


CCCCGGG 


2GC 


4; 


840 


TCC 


3TGTGTG 


GCTGCGATGG 


TGGCGTTTTT 


GGGGACAGGT 


GTCCGT 




T 


ZC 


GTGTCGC 




4:. 


9 0 0 


TCG 


2CTGGGC 


CGGCGGCGTG 


GTCGGTGACG 


CGACCTCCCG 


GCCCCG 




j 


ZG 


AGGTATA 


tct 


4 2 


96 0 


TTC 


^CTCCGA 


GTCGGCAATT 


TTGGGCCGCC 


GGGTTATAT 














41 


99 9 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 
( i i i ) HYPOTHETICAL : 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 



Genomic DNA 
NO 



(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CTCCCGCGCG GCCCCCGTGT TCGCCGTTCC CGTGGCGCGG ACAATGCGGT TGTGCGTCCA 

CGTGTGCGTG TCCGTGCAGT GCCGTTGTGG AGTGCCTCGC TCTCCTCCTC CTCCCCGGCA 

GCGTTCCCAC GGTTGGGGAC CACCGGTGAC CTCGCCCTCT TCGGGCCTGG ATCCG 



6 0 

1 2 0 
17;, 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TO POLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
( i i i ) HYPOTHET ICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



GGTCTGGTGG 
GTCGTGCCCG 
GCGGCGTTGG 
TTCGGGGCCG 
GGCGGTGTGA 
GTTCGTGTCT 
CGGGGGACGT 
TCCCCTTCCC 
GCCCGGCCGT 
GGCGGCCACT 
TGCCCCCGCG 
GAAGGCTGCG 
GTCCTTCGTC 



GAATTGTTGA 
GCGCCGGACG 
TAGTCTCCCG 
GCGTTGCTTG 
TTCCCGCCGG 
CGGGAGCGGT 
TCCCGTCGCC 
CGTTTCGCCG 
GCTGCCGGAC 
GTGGTCCGGG 
GGCTCCCGTG 
CACGTTGTCG 
GTCCCGTCGG 



CCTCGCTCTC 
TGTGTCGGGG 
TGTTGCGTCT 
GCTTACGCAG 
TTTTGCCTCG 
GGTTTTTTTT 
CCCTGCCGCC 
TCGGTTCTCC 
CCCCCCTTCT 
AGCTGCTCGG 
GCCGACGCGG 
GTCCTTGCGA 
CGGTGGATCC 



GGGTGCGGCC 
CCCACTTCCC 
TCCCGGGCTC 
GCTTGGTTTG 
CGTCTGCCTG 
TTTTTCGGGT 
GGTGGGTTTT 
CCGGTCGGTC 
GGGGGGGATG 
CAGGCGGGTG 
CGTGTTCTTT 
GGGAAAGAGG 
GGCCT 



TTTGGGGAAC 
GCTCGAGGGT 
TTGGGGGGGG 
GGACTGCCTC 
CTTTGCCTCG 
CCCGGGGAGA 
CGTTTCGGGC 
GGCCCTCTCC 
CCCGGGCACG 
AGCCAGTTGG 
GGGGGGGCCT 
CTTTTTTTTT 



GGCGGGGTCG 
GGCGGTGGCG 
TGCCGTCGTT 
AGGAGTCGTG 
GGTTTGCTTG 
GGGGTTTTTC 
TGTGTTCGTT 
CCGGTCGGTC 
CACGCGTCCG 
AGGGGCGTCA 
GTGCGTGCGG 
TTAGGGGGTC 



60 
12 0 
180 
240 
3 0 0 
360 
420 
480 
54 0 
6 00 
660 
720 
755 



(2) INFORMATION FOR SEQ ID NO: 20: 

;i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 4 6 3 base pairs 

( B ) TYPE : nuc 1 e i c ac i d 

(C) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(11! MOLECULE TYPE: Genomic DNA 
(ill) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 : 



GGCCGAGGTG 
TAGCGGGTAC 
TTCTGGTTGT 
CAGGTCAGCC 
AGCGAGCCCG 
CCCCGGGGTT 
CTGGTTCCGG 
CGGGTCTCCC 



CGTCTGCGGG TTGGGGCTCG 
CGTCGCCGCG CCGAGGTGGG 
TGGCGGCGGG GGCTCCGGTC 
TCCGCCTGTG GGCTTCGTCG 
TCCGTTCGAC CTTCCTTCCG 
TTCACGGCGC CCCCCACGCT 
TCTCCCCGCC AAACCCCGGT 
AACCCCCGGC CGGAAGGGTT 



TCCGGCCCCG 
CGCACGTCGG 
GATGTCTTCC 
GCCGTCTCCC 
CCTTCCCCCC 
CCTCCGCCTC 
TGGGTTGGTC 
CGGGGGTTCC 



TCGTCCTCCG 
TGAGATAACC 
CCTCCCCCTC 
CCCCCCTCAC 
ATCTTTCCGC 
TCCGCCCGTG 
TCCGGCCCCG 
GGG 



GGAAGGCGTT 
CCGAGCGTGT 
TCCCCGAGGC 
GTCCCTCGCG 
GCTCCGTTGG 
GTTTGGACGC 
GCTTGCTCTT 



6 0 
120 
180 
240 
3 00 
36 0 
420 
463 



(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TY r PE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE 
( i i i ) HYPOTHETICAL 
(iv) ANT I SENSE : NO 
<v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE 



Genomic DNA 
NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 

GGATTCTTCA GGATTGAAAC CCAAACCGGT TCAGTTTCCT TTCCGGCTCC GGCCGGGGGG hG 

GGCGGCCCCG GGCGGTTTGG TGAGTTAGAT AACCTCGGGC CGATCGCACG CCCCCCGTGG 12 0 

CGGCGACGAC CCATTCGAAC GTCTGCCCTA TCAACTTTCG ATGGTAGTCG ATGTGCCTAC 1.8 0 

CATGGTGACC ACGGGTGACG GGGAATCAGG GTTCGATTCC GGAGAGGGAG CCTGAGAAAC 24 0 

GGCTACCACA TCCAAGGAAG GCAGCAGGCG CGCAAATTAC CCACTCCCGA CCCGGGGAGG 3 00 

TAGTGACGAA AAATAACAAT ACAGGACTCT TTCGAGGCCC TGTAATTGGA ATGAGTCCAC 3 60 

TTTAAATCCT TTAAGCAG 3 78 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 
( i i i ) HYPOTHETICAL : 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



Genomic DNA 
NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 



GATCCATTGG AGGGCAAGTC TGGTGCCAGC AGCCGCGGTA ATTCCAGCTC CAATAGCGTA 6 0 

TATTAAAGTT GCTGCAGTTA AAAAGCTCGT AGTTGGATCT TGGGAGCGGG CGGGCGGTCC 12 0 

GCCGCGAGGC GAGTCACCGC CCGTCCCCGC CCCTTGCCTC TCGGCGCCCC CTCGATGCTC 180 
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TTAGCTGAGT TG T C C C G CGG GGCCCGAAGO GTTTACTTTG AAAAAATTAG AGTTGTTTOA MO 

AAGCAGGCCC GAGOCGCCTG GATACCGCCA GOTAGGAAAT AATGGAATAG GAOOGCGGTT 3 00 

CCTATTTTGT TTGGTTTTCG GAACTGAGCG CATGATTAAG GGAAAOGGCC 1' ^'AT'F' J ,M' 

CCTTATTGClj CCCCCCTA i V 8 



(2) INFORMATION FOR SEQ ID NO : 2 3 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 719 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRANDEDNE S S : s i ng 1 e 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE : 
( i i i ) HYPOTHETICAL : 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



Genomic DNA 
NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23; 



GGATCTTTCC 
TCCCCTCTCC 
GGGGGACCGC 
GCCGCGACCG 
GGCGCGTCTC 
TTTCGCGGAA 
CCCCGTCCGC 
GGTTTCTCTC 
GGACTGTCCT 
TCACGCCGCC 
GACCCGTCTT 
AAAGCCGCCG 



CGCTCCCCGT 
GGAGGGGGGG 
CCCCGGCCGG 
GCTACGAGAC 
AGGGCGCGCC 
TCCCGGGGCC 
CTCCCGGGCG 
TCTCCCGGTC 
CAGTGCGCCC 
CCCGACGAAG 
GAAACACGGA 
TGGCGCAATG 



TCCTCCCGGC 
GAGGTGGGGG 
CAAAAGGCCG 
GGCTGGGAAG 
GAACCACCTC 
GAGGGGAAGC 
GGCGTGGGGG 
TCGGCCGGTT 
CGGGCGTCGT 
CCGAGCGCAC 
CCAAGGAGTC 
AAGGTGAAGG 



CCCTCCACCC 
CGCGTGGGCG 
CCGCCGGGCG 
GCCCGACGGG 
ACCCCGAGTG 
CCGATACCCG 
TGGGGGCCGG 
TGGGGGGGGG 
CGCGCCGTCG 
GGGGTCGGCG 
TAACGCGTGC 
GCCCCGTCCG 



GCGCGTCTCC 
GGGTCGGGGG 
CACTTCAACC 
GAATGTGGCT 
TTACAGCCCT 
TCGCCGCGCT 
GCCGCCCCTC 
AGCCCGGTTG 
GGCCCGGGGG 
GCGATGTCGG 
GCGAGTCAGG 
GGGGCCCGAG 



CCCCTTCTTT 
TGGGGTCGGC 
GTAGCGGTGC 
CGGGGGGGGC 
CCGGCCGCGC 
TTTCCCCTCC 
CCACGCCCGT 
GGGGCGGGGC 
GTTCTCTCGG 
CTACCCACCC 
GGCTCGCAOG 



6 0 
120 
180 
M0 
\n 0 
I h 0 
420 
480 
S4 0 
LUC 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



CGAGGCCTCT 
AGGTGGAGCA 
CGAAGCCAGA 
CGACCTGGGT 
TTTCCCTCAG 
CGGAATGGAT 
ATGGGTAAGG 
GGCCACTTTT 
CCGATGCCGA 
TGGCCATGGA 
GCCCTGAAAA 
AACGGGACGG 



CCAGTCCGCC 
CGAGCGTACG 
GGAAACTCTG 
ATAGGGGCGA 
GATAGCTGGC 
TAGGAGGTCT 
AAGCCCGGCT 
GGTAAGCAGA 
CGCTCATCAG 
AGTCGGAATC 
TGGATGGCGC 
GACGGGAGCG 



GAGGGCGCAC 
CGTTAGGACC 
GTGGAGGTCC 
AAGACTAATC 
GCTCTCGCAA 
TGGGGCCGGA 
CGCTGGCGTG 
ACTGGCGCTG 
AC C C C AG AAA 
CGCTAAGGAG 
TGGAGCGTCG 
GCCGC 



CACCGGCCCG 
CGAAAGATGG 
GTAGCGGTCC 
GAACCATCTA 
CCTTCGGAAG 
AACGATCTCA 
GAGCCGGGCG 
CGGGATGAAC 
AGGTGTTGGT 
TGTGTAACAA 
GGCCCATACC 



TCTCGCCCGC 
TGAACTATGC 
TGACGTGCAA 
GTAGCTGGTT 
CAGTTTTATC 
AACTATTTCT 
TGGAATGCGA 
CGAACGCCGG 
TGATATAGAC 
CTCACCTGCC 
CGGCCGTCGC 



CGCGTCGGGG 
CTGGGCAGGG 
ATCGGTCGTC 
CCCTCCGAAG 
CGGGTAAAGG 
CAAACTTTAA 
GTGCCTAGTG 
GTTAAGGCGC 
AGCAGGACGG 
GAATCAACTA 
CGGCAGTCGG 



•50 

120 
180 
240 
3 00 
3 60 
420 
480 
540 
600 
660 
685 



(2) INFORMATION FOR SEQ ID NO : 2 5 : 
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{ l ) SEQUENCE CHARACTER I ST1CS : 

(A) LENGTH : 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
( i i i ) HYPOTHETICAL : NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE : 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 5 : 
GAGGAATTCC CCTATCCCTA AT C C AG AT T G GTG 3 3 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
( i i i ) HYPOTHETICAL : NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

AAACTGCAGG CCGAGCCACC TCTCTTCTGT GTTTG 

(2) INFORMATION FOR SEQ ID NO : 2 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGGAATTCAC AGAAGAGAGG TGGCTCGGCC TGC 33 
(2) INFORMATION FOR SEQ ID NO : 2 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(XI) 



SEQUENCE DESCRT PTION : SKf) IlJ NO : 2H : 



AGCCTGCAGG AAGTCATACC TGGGGAGGTG GCCG 



(2) INFORMATION FOR SEQ ID NO : 2 9 : 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 8 0 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
( i i i ) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

AAACTGCAGG TTAATTAACC CTAACCCTAA CCCTAACCCT AACCCTAACC CTAACCCTAA 6 0 

CCCTAACCCT AACCCGGGAT 8 0 

(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
( i i i ) HYPOTHETICAL : NO 

( iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TTGGGCCCTA GGCTTAAGG 19 
(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
( i i i ) HYPOTHET I CAL : NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 
GCCAGGGTTT TCCCAGTCAC GACGT 2 5 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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( ]_) ) TO PC) LOGY : linear 



( i 1 j MOLECULE TYPE : Genomic UNA 
(111) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
(vij ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GCTGCAAGGC GATTAAGTTG GGTAAC 2 6 

(2) INFORMATION FOR SEQ ID NO : 3 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TY r PE : Genomic DNA 
( i i i ) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 3 : 
TATGTTGTGT GGAATTGTGA GCGGAT 2 6 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
( iii ) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGGTTTAAAC AGATCTCTGC A 21 
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